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ABSTRACT 


This thesis suggests p. new ’Ouory lan.^'uage' for 
infor’^aation retrieval from a Eelntional Bata Model. It 
also shows the implcmentability of the language. All the 
system programs have boen written in PASCAL, because of 
the excellent chara.ctcr manipulation capabilities of the 
language. The system has hern successfully implemented 
on EEC 1090 System available in the Computer Centre of 
Indian Institute of Technology, Eanpur. 



CHiPTilS I 


INTRO DUG TIOi'' 

1 1 Data Base Managt^mPnt Syst ora i 

DATA, os usually conceived of , is oo'ic collection of 
information, gathered from sorao^^hero , with some intentions - 
may he clearly defined hut not usually so. If allowed to 
remain in its crudes 1; form, the dat^ is of no jpractical use 
to anyone, even if the purpose for which it woe collected was 
well defined. It has to he converted to a foj'iC such that it 
can he comprehended, partly if not wholly, hy anybody who 
wants to make uso of ib. By converting it to some such form 
and storing it, may he in tho secondary memory of a computor, 
one is moving towards, what we ca3-l as 'building a data base' 
But a lot more needs to he done. How to y-'b the desired 
information out of thic stored structure like .iagnetic disk o 
tape? Some methodology has to he adopted for the purpose. 
This methodology, technically known as ' Da,ta Prise Management 
System' , should be such that a user can construct his own 
'information need statements' and interrogate the data base 
without having much knowledge of tho system. Of course, he 
is expected to know what data is stored in the system which 
he wishes to interrogate. (Otherwise what is h^^ looking for 
aniA^ay ) . 
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Tho user of r, data base could b>-' anyone. Por example, 
in an industrial organization, be could bo o,uyonc right 
f-’ora the foreman at the shop floor level to tho top managers 
0 '‘'‘ the organization. Each of those persons has a, different 
vjap of thinking and expres-'int his needs. Whereas a computer 
system, like the DEi''IS, roo'uireo constructs in a very formal 
language before it can cater to the needs of the user. 

1.2 Motivation .’ 

In this thesis, an actompt has been mode at describing 
a nev language which enafulec one to constroct Queries to 
interrogate a Eslational Data Base. The stai om'^nts in the 
language are cuch that, with tho knowledge of the S 3 rntax of 
the I'inguage, they come naturally from the word statement of 
the au')ry. The idea is taken from 'Decisio:.' Tables' because 
it is felt that any query can be naturally split into two 
parts - a condition part and an action pari,. Moreover, the 
concept of a decision table is easy to understand. 

A Relational model has beeji chosen for the data base 
because of its undoubted conceptual superiority over the 
other models like the Wetwork model 3.nd^ tho Hierarchical 
model. It is assumed that the relations stored in the data 
base are in tho Third normal Porm. 

The motivation behind this wo'*’'k has come from the fact 
that, in general, the user of any data base will not be a 
computer professional. Any occasional user (of a probably 
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large data base) must "be able to satisfy his information 
nood without having to hire a computer professional for 
tho purpose^ In other words, this thesis suggests a Higher 
level Language with the idea that the user will find it easy 
to program his queries in tiiis language* It also makes an 
r,ttompt at showing that the larguago is complete and it 
can be implemented on any of tie present day big computer 
systems. Examples have been taken from Codd's paper [4] 
to show t'let the language can r.;trievo information for all 
sorts of complex but meaningful q.ueries. 

1*5 Organization of tho Thesis ; 

After a brief introduction in Chapter I, a study 
of tho present relational d ^ta baso systems has been done 
in Chapter II, Chapter II' doscribes the new proposed 
language. It shows how tho diCf^,'ront constructs have been, 
developed and gives the full syntax details of the language. 
Chapter IV is devoted to the implomontation d- tails, which 
include processing of oiAorios in hhe language and converting 
them to tho relational algebra OTprossions. Algorithms for 
implementing the different relational algebra operations 
have boon given in Chapter V. Thu concluding chapter 
summarizes tho work done and indicates the scope for 
improving the system. 



CHiPTES. II 


PRSSEFT STATE OP ART 


2.1 T lic ’Relational Mode l' : 

Ttie irolottional model of d.a,ta taso was inti' educed 133?' 
-a*P, Codd of the IBM Research Laboratories in 1970. Ho was 
particularly interested in providing tho user of iho data 
with a data model whioh would be independent of tho various 
implement a. t ion considerations. He tras also interested in 
making available to the user e high-level language, which 
would be non -procedural in nnture, for accessing the data. 

In his first paper (ref^ [6]) he says that, 

'The relational model of data provides a means of 
describing data with its natural structure only - that is 
without superimposing any additional structures for machine 
representation purposes, iccordingly, it provides a basis 
for high-level data language which will yield maximal indepen- 
dence between programs on tho one ha.no and machine representa- 
tion and organization on the other, A further advantage of th 
relational view is that it forms a sound basis for treating 
derivability, redundancy and consistency of relations' . 

A relation is defined in mathematics as follows; G-iven 
sv,ts m, D2, ... Dn (not necessarily distinct), R is a relatio 
en these n sets if it is a set of ordered n-tuples dl,d2 ,.,.dn 
such that dl belongs to Bl', d2 belongs to D2, and so on* 



S:;t;p Dl, D2,.*,Dn ar-o called th* dociains of fl, Tho valuo n 
is callGd tho d;groc of E, 

It is conveniont to r pjosont a relation in tho form 
of ^ table, '■■hars each row represents one n-tuplc. The relation 
'Port' shorn in Eigurc 2.1 cons?Bt3 of 6 tuples. 


Pig, 2.1 2 A typical rtlation. 
PART 


p ^ 

PNAM5 

COLOR 

UUIGITT 

OOH 

1 

NUT 

RED 

12 

4 

2 

BOLT 

GRHIEN 

17 

7 

3 

SCRBN 

BLUE 

17 

3 


SCRBV 

RED 

14 

9 

5 

CAN 

BLUE 

12 

2 

6 

COG 

RED 

19 

1 


It is obS'-^ived that a rwl ution has tho following 
properties ; 

(1) ITo tvjo rows arc identicn.1, 

(2) Ordering of rows is insignificant j, 

(3) Ordering of columns is also insignificant , 

(4) Every v-oluc within a relation is o.n atoiiic data item. 

The last property is true- in c-aso tho relation is in the 
'Third Normal Form' (Ref. [bj). 

It was pointed out by Codd that the relational operation 
defined in m^ithematics can be applit'd on these relations. He 
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showed how to carry out the operations like projection and 
join on these relations. 

Most existing systems arc not relational as a natural 
outcome of the way in which computing technology has itself 
developed. The earlier computer systems had comparatively 
small capacity and high accoss time. Traditionally, the 
emphasis was on sequential media like tapes and cards. With 
modern hardware and techniques, it has become possible to 
design and implement more largo scale systems based on the 
relational approach. 

2 Data Sub Languages for the Rolational Model : 

For query purposes in a relational model, a data sub 
language (DSL) should allow the user to specify the relatior 
he wants retrieved from the data base. It is to bo pointed 
out here that any information that a user obtains from the 
data base will be in the form of a new relation which will 1: 
derived from the original relations stor.''d in the system. 
There are two ways in which a user can specify the new 
relation he wants retrieved from the da' a base - 

(1) A non-procedural method, in which ho xTOuld simply 
state the definition of the desired result and leave it to 
the system to determine the xequirod opera tiors. 

(2) A procedural method, in which ho would actually spec; 
the step-by-step operations to be performed to produce the 
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dGsired rosuli:. (The operations here refer to the relational 
operations - join, prooect and divid'). 

2.2.1 tiodd ' s DSL ALPHA, is a non procedural ( c?„lculus based) 
language. It is data indep. ndont in that it contains 
absolutely no references to ctorago/acc ss d -tails such 

as indexes, pointers etc. The language is complete. The 
complexity of a statement in DSL ALPHA is in direct propor- 
tion to the complexity of hie operation the user is trying 
to perform, 

2.2.2 A procedural language (algebra based) also provides 
full data independence. It is a reasonably simple langap.gc 
but probably less simple tha.n DSL ALPHA. Compared to most 
existing languages, which tend to be very low level, the 
algebraic language can bo called to be non-pTOCOdural. Hut 
compared to DSL ALPHA it is definitely more procedural. 

2.2.3 As an illustra,tion of the typos of statements in the 
two languages mentioned above, let us take the following 
query based on the rel'^tions given in Pig. 5.1. 

Word Statement of query ; 

Pind the part names of ports supplied by supp3-ier 
' Jono s ' . 

The query as converted to o ne in rel ?.t ional algebra ; 

JOIN SUPPLIER and SUPPLY (over S# )j 
JOIN the result and PAPT (over P ^ )? 

PROJECT the result by PNAME whore SNAJ® = 'Jones’. 
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The query as convertod to ono in r clationn l c alcul^ie ? 

GET W P.PJfAME : 3 GUPP1Y((PART.F# = SUPPLY. P#) 

A ( SUPPLY. = SUPPLIER. S#) 

A ( SUPPLIPP . SMME = ’ JONi'S ’ ) ) . 

Por the sake of simplicity it is essential that nho 
language made a,vailable to the user non-prooodural. The 
DSL ALPHA, would he a good candidate for the same. But 
vrriting queries in DSL ALPHA requires the knowledge of 
'first order predicate calculus'. Symbols like 'J ' (there 
exists), 'V" ' (for all) and (such th-^t) tend to scare 

away the user of this language. 

2 • 5 Existing Systems ; 

There have boon auito a lot of small rclrtional systems 
in the recent past. But most of them do not insist on any 
particular level of normalization. They arc enntont with 
the relations being in the First Normel Porn, Obviously a 
data model in the IHF can not exploit most of the potentiali- 
ties of a relational data base. 

2.3.1 MacAIMS appears to bo the earliest example (1970-71) 
of a system providing both a, full I'clational model and a 
language (based on algahra) which actually views the d\ta 
in its relational fom. The lo^nguago, however, is not com- 
plete, in the senso that not all relations derivable from 
the data model by means of first order predicate can, he 
retrieved in this language. Ono interesting feature about 
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MacAI^-o is that tho structure may vary from r^^lation to rclatio] 
(thus allowing eech relation tc ho stored in xho fomi most 
suited to it). For details of tho Irnguagc rjfer to [ 8 ], 

2 . 3.2 MOr.I>S ( 1972 ) incorporates some of tho j-dea,s of 
MpcAIT'IS. In it, howGver, tlic user's vi':/ of "h^j date may 
include unnormalizod relations, which arc definvod in terms 
of tho fundamental normalized relations in the data model. 

Thus the user is provided with a limited form of data suh- 
model. Though the language is hasod on relational calculus, 
yet it is (also) possible to perform a retrieval using 
algebraic operations. For further details refer [9]. 

2.5.'^ The IS/l system (1972) also provides a limited form 
of data submodel. An 'implied jolationship' is defined by 
a set of (algebraic) statononto of this language which 
specify how other relations are to joined, projected, and 
otherwise maniprlatod to produD o the ronuirod result. Tho 
first time the user refers to this relation, tho definition 
is invoked and the desired relation is gc-neraied. Subsequent 
operations are performed directly on tho geneiated relations. 
This system developed some useful optimization techniques 
for evaluating algebraic ejcprossions. For further details 
of this language refer to [ 10 ]. 

2.3.4 The SEQUEL prototype was originally developed as a 
single user 35 ^ 8 tern to demonstrato the foasibility of support- 
ing the SEOUBI language (1974). This language is complete in 



10 


the sensG that any relation, dorivatlc from the dnta model 
by nca-ns of first order predien,to, can bo derived in tills 
language. It does not involve symbolir.m of the predicate 
colculus (i.o., there are no qugntifiors'' . It iis based 
on a construct known as 'mapping' which may be viewed as a 
subsetting followed by a projection. lie. syntax is quite 
hnglish-like. As an example the query above may be written as 

SELECT PEAlffi EEDW PAST 

TOEPE P # = 

SELECT P ^ PR0T4 SUPPLY 
WHERE S # = 

SELECT S # PROM SUPPLI''dR 
WHERE SHAME = ' JOUES' . 

However, this system has boon extond'^d by the IBM Cambridge 
Scientific Centre and the MIT Solan School Energy laboratory 
to allow a singlj type of concurrency end is being used as a 
component of the G-eneralized Management Information System 
(GMIS) being developed at MIT for energy related applications 
For further details on the language ref-jr to [7] and [11]. 

Of the systems given above, MacAIMS is not complete 
and MORIS and IS/l present to the user a data submodel which 
has unnormalizod relations. SEQUEL is a language xrhich is 
complete and without Quantifiers. But the id 'is of 'mapping' 
and 'recursion', which are made use of in this language, are 
not very simple to grasp by an ordinary user. Most of the 
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langUfigGis of tho ■<:;ype SEQUEIj though avoid the uoo of quanti- 
fiors of the relational calculus, require tho user to evolve 
r method of generating the output relation. Like in the 
above example, whore the constraint is on the supplier name 
and the attribute required in the output relation is partnamo, 
the SECITEI query clearly specifics ho‘.ir tho ’.SUPPLY' relation 
is to b'^ used tec reach tho 'PART' relition starting from the 
'SUPPLIER' relation. 

The language we arc looking for should be reasonably 
complete (equivalent to DSI ALPHA) oJid should also not make 
use of any quantifiers. Apart from this, it should not 
involve any difficult concepts. As in tho above example, 
we should not expect tho user to know how tho path is to 
be traced while executing his query. If need bo, ho should 
bo expected to know only this much that the relation 'SUPPLY' 
will bo involved in getting for him the output relation he 
requires, and not how it has to be used. 



CHA.PTj2P III 


PORMUIATION OP LANGUAGS 

The basic id^a behind cpccifyino; this language- is 
that a query can be split into s condition part and an 
action part. Then it shoiild bu possible to enumerate all 
the poSvSiblo typos of conditions and actions. Pinally it 
should bo possible to combine those types of conditions 
and actions -which require the same type of manipulatiOiis on 
tho relations. 

3 . 1 Parts of a Qu ery : 

First we show that ajiy query can be split into a 
condition and action. Thv.n wo go on to studying these two 
parts separately, let us start with considering a few 
examples (refvsr to the relations in Pig. 3-1)" 

fucry Find ‘the part name and quantity-on-hand of parts 

whose quantity -on-hand is less than 5- 

Clearly the condition on all tuples in the output 
relation is that the quantity-on-hand for the part be less 
than 5. There is no other constraint that one can obviously 
see. The action part is that it is roa'iirod to get into th 
workspace the part names and the quantities-on-hand . Thus 
the query can be written as - 
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Supplier 





S# 

SI'TA.M.J 

IOC 


1 

SWITI-T 

TDT'TRO” 


2 

JOFES 

PATTIS 


3 

I, A. ICE 

PA.RIS 


4 

clack: 

LOFDOF 


5 

ADAMS 

ATB^l'TS 

laj 

't 



P# 

PFAT'IE 

COLOR '*r 

SIGHT QOH 

1 

I'TUT 

RED 

12 4 

2 

■^OLT 

GREEF 

17 7 

3 

SCRESr 

BLUE 

17 3 

4 

SCREW 

RED 

14 9 

5 

CAM 

BLUB 

12 2 

6 

COG 

RED 

19 1 




J# EGB-m 


1 

SORTER 

24 

2 

PITHCH 

11 

3 

READER 

17 

4 

CONSOLE 

9 

3 

COLLATOR 

27 

6 

TERMIFAL 

18 

7 

TAPE 

13 



Jtet^'Base 
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Concf'iGion; Quan tit y-oii- hand of part less than 5. 

A.ctiont Get part name rnd ouantity-on-hand into the 

work space. 

let us look at another exa'''’pl'' 

Quo r e Find the names and loc-^tions of all suppliers, each of 
whom supplies all jols. 

The condition hero is that all the jobs be supplied. By 
all jobs it means that the jobs, listed in the relation JOB, 
be troato.d as ono entity. The action required is to get into 
the workspace the names and locations of suppliers. Thus the 
query can be written as: 

Condition: ill jobs. 

Action: Got into the workspace, namo and location of 

suppliers. 

Let uj £0 on to look at another slightly complex query: 

Query Find thi- name and location of all suppliers who supply 
atlcast those jobs supplied by supplier 'SMITH'. 

This query being slightly cnmplicatod , has to be split 
into two s nailer queries. 

Sub Query A Find the job numbers of jobs supplied by supplier 
' Smith' s 

Sub Query B Find the name and location of suppliers who suppl 
atlcast those jobs whoso job numbers arc in the 
wo rk sp ac e abo vo . 
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This two stop query cph bo split into :'onr'xtion and 
actii/n as follows; 

Condition; Supplier name 'Smith’ 

Action: Get into workspace "Che job numbers siipplied. 

Condition; All job numbers in the above workspace. 

Action: Get 'into the final workspace supplrer names and 

locations. 


From the throe examples that wc hovo consid'red, it 
seems that splitting a query into its condition and action 
components is not difficult. 

5 . 2 ^''rorkspace ; 

It can bo noted that the concept of workspace is import- 
ant. Workspaces have the following features; 

(1) They arc not part of fchr data base itself^ so that tiiere 
will bn no inconsistency due jo the creation cf redundant data. 

(2) The work spaces b long to one mdividual user or process, 
so that the manipulations of data within such 'trorkspaccs do not 
conflict with other users. 

(3) A naming scheme is employed to distinguish relations 
and attributes within the work space from the relations and 
attributes in the data base. 

Mow -we look at the condition jpai't in more details. 
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3 . '5 Condition Part ; 

This is essontiaHy tho sarao as the p> -dicate in 
DSl ALPHA. Lika a predicata, a condition uay consist of an 
expression of axhitrary coinplGnity, for'iulated according to 
tho usual rules. The ■oormitted operators ax'e the comparison 
operators * = ', . ' < S the Boolean operators 

'OR', 'AHL' , and 'HOT' and of course parentheses '(' and ')' 
to enforce a desired order of evaluation. 

The condition part of acy query con be reduced to a 
normal form: 

C = (Oil AI^TD 012 AND . . .AHD CIH^) 

O; (021 AND 0 22 ... AND O 2 N 2 ) 

OB (Ol'Il AND 0M2 ... AND OIIN^) 

where Oij^ is a oi mp le c o n di b 10 n v;ith on.' comparison operator. 
Tho result of a general nucry, with condition pai't as 'C, 
c'^n be obtained by applyiug 'union' and 'intersection opera- 
tions on tho results of individual auories with simple condi- 
tion parts. For example, the final result for condition 'O' 
will be given by, 

H = (RlinR12 ...fliaNT)U (B2lf^R22 ... AR 2 N 2 ) 

. . . U (RMl/^ RII2 ... n RI'DTit) , 

where Rii is the relation retrieved by a query with the 
(simple) condition part Cio^j^. The system is designed for a 
general condition of the above type. For explanation purposes 
however, we will consider only queries w.ith simple conditions. 
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How let uo enumerate the (different; tyj es of possible 
conditions reai^-ured to incorporate all types of oueries. Here 
it is to be pointed out tL-.t 'fiort has been mad o to include 
all t5rpes of conditions that ui'’ encountered in queries. But 
the systs’i can be? easily modified fer inol.udin'a, any further 
t'/pes that do not fall in ao / of ‘‘.he tAn;>es that are described 
here. This v:ill be- evident from the structure of the system. 

Given below is the list of seven different types of 
conditions. in example query lias been given with each for 
explanatory purposes. 

(1) 'The value of an attribute in a j elation is equal (or 
unequal) to a specified valuo. Por example - 

C'uein/ 1 °. Get the part name and quantity- on-hand of 3 parts 
for which the quantity-on-hand is less than 5. 

(2) The value of an attribute in a relation is ooual (or 
unequal) to (a) the n-th largest V-ilue foT that attribute in 
that relation-j (b) tho n-th small? st value for that attribute 
in that relation. Por example - 

Ouory 2 ; Bind the part number and part nemo of parts for whicl 
qua-ntity-on-hand is tho largc^'t. 

(3) Something is required to bo done for each of the 
values of an attribute in a pa.rticular i elation. Por example - 

Query 3 % Bind in the increasing ordo;’ of quantity-on-hand, 
the part name and quantity-on-hand of parts being 
supplied . 
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(4) In a relation, tht: number of tuples, )ith same values 
for some attri’ atos and difforont *70 lues for somo other 
attrihutes in that relation, is eoual (or un?o,ual) to a 
specified number. for example •- 

Query,.,^'’; ~ G-ot the mu.foei'- of parts b :inp supplied to more than 
two jobs. 

(5) In a, relation the toto.l of pni attribu.'C, with same 
vad-ues for some other attributes in thit relation, is equal 
(or unequal) to a specified v,o,luo. for exoimplo - 

< "'Uory 8 ; for those parts, fo*" which the total quantity being 
suppli‘='d to a job is grca"er than 5 , got, as a 
trjple, part number, the job number and the total 
quantity of that pai’t being supplied to that job. 

(6) All tho different values of an attribute in a relation 
ore to bo tre<^tod as one entitj^, I'or example, -■ 

Query 6 " G-ct the supplier namf^s and locaticr. s of suppliers 
who supply to a,ll jobs. 

(7) I'ho valuo of an attribute in one rol':)“ion is not 
present in tho values of that attribute in another relation, 
Por example “■ 

Query 7 ^ lot tho name and location of those suppliers who do 
not supply to any job. 

now construct statements of tho language for these 
conditions. It can be noted that the comparison operator for 
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conditions of the types and 5 could be any of the six, 

i.e,, =,< >, <, > , or > = . Por condxtiojis of the type 5 

and 6, only the oouality ( = ) oper'^tor is valid and for condi 
tion of the typo 7, only the 'not eouo-l fco ’ (< >) operator 
is valid « The syntax for th:sa eimplo conditions is as folll.ox"s- 

(1) ■poDation nano, xttribut' nam.. <0‘(h'r' > '' value'' , 

The condi l.ion for example Query 1 becomes ~ 

PAhT.OOH = " 0 " 

(2) (a) bolation name , at tribut o nam* <C0Tjr'>n TOP 
(h) Bolation name, at tribute name <C0>TP>n BOTTOM 
The condition for exaraplo Query 2 bacomos - 
PA.PT.OOI = 1 TOP 

(9) Relation nrme.n-ttribuco xiamo = * 

The condition for oxam-nlo Quor.y 7 becomes - 
SUPPLY. 'B H = Hi: 

(4) COUNT (relation name SAMT sittribute nrmos DIPPERENT 

attribute nones) <00141) > positive integer 
The condition for example Query 4 becorass - 
COUNT (SUPPLY same P# urSPEUEFT J # ) > 2 

(5) TOTAL (relation name SAP^E attribute names i attribute 

n^’me) < C0ND> value 

The condition for example Query 3 bocomt.n - 
TOTAL (SUPPLY SAME B if , J# s QTY) > 5 

(6) Relation name, at tribute name - A.LL 

The condition for example Query 6 becomes - 


JOB. J # = ALL 
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(7) Rolation namo.attri'butu iia,nie <> relation name, a bt ribute naiae 
The condition for exaiUT'le Qu-^ry 7 bocom'^8 — 

S'jj'Tiirh.s# <> sxjppiy.Sf/ 

Here '<>' is th oymbol used for 'not ooual to'. The rooorvcd 
viords of the I'^nguago arc given in capit*'! letters. <0010) > 
rcf''r9 to any of the six comparison operators. 

he li-’V''. considorod only quer-ioo with a simple condition 
part. Now wo give examples of two more quorios which have a 
ge.-nGral condition part. 

Qu ery 8 ; Get the nfimcs of those suppliers who supply to job 
number 4 and are located in Athene. 

The condition for example query 8 bocomos - 
SUPPLIJ®.L0C = " ATHIHS" AFT SUPPLY. J P= ”4" 

Query 9 » Got the names and locations of suppliers who supply 
a blue pa-^’t or a grocn part. 

The condition for the example qu •'I'^y 9 becomes - 
PART. COLOR = " Blun" OR PART.COLOL = " GEBUiT" 

5*4 Action Part : 

This part of tho query just indicates the domains (i.e, 
attribute names) to ho brought into the worh spn-co. It is 
ana.logous to the 'target lisy of LSI ALPHA. Apart from speci- 
fying the attribute names (qualified by tho appropriate rolatic 
names) to bo retrieved, it might (also) bo rcouirod to do the 
totalling of a, certain field heforo displaying (example 
Query 5) or it might be required to get the count of the numbe: 



21 


of tuples ( thnt satisfy the ccndition) in the output relation 
(Example Query 4). Sometimes one might want to retrieve only 
a limited number of tuples in the output relation instead of 
getting a long list (Example Ouory 1). At other tines, one 
night want to have the output rel^-tion sorted according to 
the values of certain domains (Example iucry t). All those 
desired results are incorporated in the syvstcn and tlio syntax 
for tho action statements is defined as follows- 
GET relation name. attribute name 
The action part of example Query 2 bee omes - 
GET PART.Pi^ , PART. PEA 'iffi 

or GET TOTAI'( relation name SA^IG attribute names sat tribute nam 
or any combination of tho above two. It is assumed that the 
domain, if any, of which tho totalling is required is tho last 
in thu output list. The a.ction patt of example (~'uery 5 bocomos- 
GET PART.P^ , SUPPLY. J# , TOT AL( SUPPLY SAP® P# ,J# ;QTY) 

For getting only a limited number (n) of tuples in the 
output relation, tho syntax is - 
GET (n) 

The action part for example Query 1 becomes - 
GET (3) PART.PIAI®, PAET.QOH 

For getting only the count of the numb''r of tuples in 
the work space, the syntax is - 

GET (coum) 

Tho action part for example Query 4 hocomos - 
GET (COURT) SUPPLY. P# 
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For getting the output in a sorted order, aftur aii 
the output list, we add the information as follows - 

UP 

get rolacion noiov^ , attrihuto name) 

Tho action part for erampl Siuer;r p hoc ones - 
GET PAGT.FFkir, PART.'" OH (UP PAPT.QO.^' 

Ilei'u the fir&t field in the UP/POhIT list is the major 
field, tb-' next is the minor field and on. 

3*5 Oo mpl oto Qu^'-HZ * 

After having defined the condition part and tho action 
part of a query, tho syntax of tie complete ouery is as follows 
Query name (¥or]c space name) 

COHDITIOH ; condition part 
ACTION ■; action pa^'^tj 

As an ill ustr-^tion th.. example Query 7 would bo 
written as - 

QUERY (VJl) 

COh DI TIOIT : SUPPLIER. S # <> SUPPI T . S 
ACTION ; GET SUPPLIER. S# , SUPPLIER. SEAMS j 

In ease of a, multi-step querj'', the abovo three lines 

I 

a.rc repeated again except that tho semicolon (j)> indicating 
the end of tho querj!-, comes only at the end of tho lo,st 
sub query. 

Tho complete syntax di.agrams for tho language just 
defined arc given in Figure 5.2. 









FIG. 3.2; SYNTAX DIAGRAMS OF LANGUAGE. 
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3.6 Compl ctanes s of t lio • 

To allow that our languor c la reasonably coniploto, wo 
will compare its featuros with the infer '•aticii r ;trieval 
foatures of DSL ALPHA. For orch of thee fo'^turos of 
DSL A^LPHA (roferenco ['ij)s vr.: iLow tlr^t there exists a corres- 
ponding fu'turc in our languo,go which performs the samo tash 
as tho one in DSL ALPHA. 

(1) Tho concepts of 'operation nn,mG' (G-DT) end ’ worksp.aco ’ ( W) 
hove boon borrowed directly from DSL ALPHA. Tho user can na.mc 
his own vp'-rkspaco and tho 'na'^e inheritance rules' of his 
workspace, as applicable in DSL ALPHA, are valid in our laii- 
guagi also (Refer to example Ouory 10 in Chapter lY ). 

(2) Corn spending to tho ' I'argot list' of DSL ALPHA we have 
tho 'action part’ in our langungo. Tho ta.rget list in DSL ALPIL? 
contains the names of the attributes (oualified by the appro- 
priate rol'tion name) and so does the action rjart in our 
language. Tho target list can contain just a relation name, 

■with no attributes specified, in case tho whole of the relation 
hauS to bo brought to the workspace. This feature is not pre- 
sent in our language. This does not limit tho scope of our 
language, howevor, because the same purpose can bo achieved by 
giving tho complete list of attributes in that relation. (The 
information about the different attributes in co,ch of the rela- 
tions can bo obtained from the 'R3LDSF' file). Some functions 
which are allowed in tho target list of DSL AJPHA arc also 
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T,ll 0 ¥ 0 (? in onr Inn'sung'.. Exannlos arj 'COuTEi" and 'TOTAT,'. 
ihc syntax for uho uso of tiioco lias '^Iro-dy ts.n described, 

T'ho functions 'yiklV and ’ I'm:! ' i-rvo not boon included in t.hc 
action pai^t but tlio purpose esn b'"' achievid by making a multi- 
stop query in our language. lor mnplo consider the follow- 
ing query in DSl ALPHA - 
PANPE Z SlJPPiy 
GET ¥ IJAX(Z.OTY) : (Z.G# =?) 

This caji be converted to the follovring in our language - 
0UBkY( VI ) 

CONB I TIOIT ; SUPPLY. S # = ” 2 " 

AUITOn: GET SUPPLY. QTY 
OUERY(W) 

COFDTTION; VI. QTY = 1 TOP 
ACTION; GET Vll.^TYj 

(3) Correspc riding to the ' ciu.'TLif icetion ' in DSL ALPHA wo 
h'^vo the 'condition part' in our longupgs. There is a major , 
differoncr, however, that the condition part is much loss 
procedural than the qualification. A simple condition 
in the condition part is entirely independent of what h?„s to 
come in the action part. Vhor'?as the qualification (being 
a qualifier of the target list) depends upon the target list. 
As an ozamplG consider the following quorp - 

Find the supplier nahies of those suppliers who supply 
the part with part number 3. 
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Th.e qualification part of the DSu ALPHA, shot-siont for this 
query bocomco 

SUPPI,IEE S 
PALGL SUPPLY Z 

GUT ’/ 3. spams ; (5.3 ^ ) A 1,2 , P^^ ,=_3j 

In our language the conJition pau't will he - 
QIT3PY (W) 

C QtTLI TIOP; SUPPLY. S tf = ^" 3 \[ 

ACTIOP; MT STOPLIER.SPAT3; 

(4) The element ordering feature in LSI ALPHA is also 
includt'd in our language. This orpresoion follows the 
qualification in LSL ALPHA whoro.as it folloiro the a.ction part 
of our language. This element ordering exprocsioD could he 
as gnnoro,! as in LSL ALPHA. (Refer to table ACTTA'^ doscrihod 
in Chapter lY). 

(5) The facility of specifying tho 'quota' in PSL ALPHA is 
also included in our language. This is done by specifying 
the number of tuples to b ? rctri'^v^d 'uithin brachetc after 
tho reserved word GUI in the iction part of our language, ( Refc 
to tho table ACTTA? described in Chapter IV). 

(6) The negation sign (“^) used in the qualification in 
DSL ALPHA, though not allowed in oui language, yet the purpose 
can be achieved by using 'not equal to’ sign (<>) and the 
boolean operators 'AHD' and 'OR' in the condition part. As a.n 
example consider the following query - 
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tho supplior numbj"':! of supplier,? vl'ic do not supply 
Part nui tor 3 (and may nob supply any pa]t at all also). 
The DSL ALPT-IA ouery for this becomes — 

M-MI. OUfrilER S 
DAl TlL SUPPLY Z ALL 

01® ¥ S . S ; ( ( S . S # ) A ( Z .P # =3 )') 

Tho query in our language for this bocoaias - 
QUERY (¥) 

COITDITIOrT ; SUPPLY . S # <> " 3 ’ ' 0 SIJPPII S # <> SUPPLY . S # 

Acragus CtEt supplier. S# 5 

(7) Punctions allowed in tho qualification in DSL ALPHA 
are also allowed in our lsngua.go. Those functions are 'TOP’ 
and 'BOTTOir . Tho synta.x for the use- of those in our language 
has already been described. 

(8) Tho 'PAFGB' sta''’enontr. in DSL AiPIiA> provided for tho 
purpose of abbreviation of relation names j are not allowed in 
out language. Thev just serve, ■’rhe purpose of avoiding the 
repetition of long relation nataes in the query statement, and 
are not functional. In as far as they pIso servo tho purpose 
of defining tho user's data sub-model, wo can provide in our 
language the ' EinYPOEIiEUT ' statement (Pefor to Ch'-ptcr VI). In 
DSL ALPHA tho r angc statG;acntc ^Isa serv-: the purpose of 
allowing tho shift of the universal quantific'*' from the Qua- 
lification to theso statemcntc. This footure has bo^-'n incor- 
porated in OU-: language by a special condition type (CMDCD = 3 
(refer Chapter IV) which specifics that all the attribute 
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vn.lut-’3 of on attribute in a relation bi. trosted as one entity. 
For v:xomplo coroi'i.'r tli. .Collo-j-iny ouury - 

Got the supplier m mb vs of c.upplioio who supply to all 
jobs. 

Tho DSI, AIiPliA. ouf'ry for this i com s -■ 

SUPPLY Z 
M':9I JO 3 J ALL 
CLlj? w z.S : (Z. J# = J. J O 
The corrcGponclins: query in our language becomes ~ 

CUZKY (U) 

COITDITIOU ; JOB J# = ALL 
ACTI^U : SUPPLY. S# j 

Thus wo Can say that ou" language incorporates all the 
fo'iturcs of DSL ALPHA as doscribod in rofcroncc [1]. 



CHAPTER IV 


SOURCE QUERY TO OBJECT QUERY 

4 • 1 Source Quer-r and Ob.iect Query ^ 

After Raving designed th.e query language, we have to 
build a system that accepts queries in this language and 
executes them. We call, for notation sake, a query in our 
new language as the 'Source Query' and that converted to 
one having relational algebra operations as 'Object Query'. 

Like any other language translator, the first phase of this 
system analyses the string of characters in the source querj?- 
so as to form meaningful S 5 rmbols. This phase is called 
'lexical analysis'. Outcome of this phase is a string of 
meaningful primitives, called lexemes (like 'GET', 'ALL' 

etc. in our language). The next phase of the system has to 
combine such lexemes, collected from the input srring of 
characters, and recognize grammatical phrases in the source 
query. This is called 'syntax analysis', finally, the system 
has to discover how the source query does what it is supposed 
to do. This is the 'semantic analysis' phase. It is required 
to build certain tables which store information contained 
in the source query (about the intent and the names of relatior 
and attributes involved), from the information stored in thesf 
tables and that already existing in the system (about various 
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relations and their attributes present in the data base)> 

the object query is construictoi . The object 'mery hs-S 
statements like 

JOI’-T R1 LlU'j £2 01 Alj 

or PROJECT R1 BY A1 A2; 

or SORT El OY ASCEMDIiYG- A1 DESCBNPIITG AOj etc. 

Here El, P2 are relation naci'-'s, either original or those 
constructed by the user for storing intermediate results, 
and Al, A2 are attribute names. 

4 . 2 lexical An a lysis ; 

The ta.sk of the lexical analyser is to recognise in 
the input string, "which is the source query, items like 
names, numbers and separators. This tasl- is done by pro- 
cedure GETYPYTSYM. hhene"vcr the syntax analyser makes a 
call to this procedure, the next symbol (lexeme) in the 
input string is passed on to the calling procedure through 
a global loanable ' SYM' . ’G-ETCH' is a standard procedure 
which reads the input text and passes on to the calling 
procedure ( G-STHEXTSY]'! in this case) the next character in 
the string. The global varia.ble for a characti-r is 'OH'. 

lIOEVh the number of reserved words in the language, 
is a constant and is set to 14 in the beginning of the 
program. These reserved word.s are ACTION, All, AND, BOTTOM, 
fOlMDITION, COUNT, DIEEEHEFCE, DOM, GET, OR, SAME, TOP, TOTi 
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and TJP. There aro 11 deliiaiiers. Tbc- ar'- -’i' (colon), 

' ( ' (iparon) , ' ) ' (rp.ar'sn'' , ' . ' (p Tiod) , ' , ' (cenma) , ' 5 ' (se^ai — 
colon) , '=’(oql), '<'(lsc), '>’( gtr } , > >' ' ( aspst ) and ' * ' ( star) 
Three other do Li ’niters aro fonned by a co ib."! notion of these. 

They arc - '<>' (nc-q), '<='(l^'q) and ‘>= ( 300 ). The reserved 
x/ords and delimiters are initialized by procediire IhlTIllIZE 
(Pig. 4.1). 


The identifiers in the language could bo upto ’Al’ 
characters in length. (AL is a constant and is set to 10 in the 
program). As usual, tho identifiers can start rith any of the 
alphabets ('A' to ’S') and can contain alphabets, digits and 
characters ' # ' and . Thes 3 last two cbaractors have been 
included because attribute names in a dat.a bas ) can contain 
such characters, (e.g. ’MG-R-P ^ ’ )• 


I'Tumbers could be integers or fleeting point reals. A 
negative number can be- represontod b;^ a sign before the 

value, but a non-nogative number should not have e, ’ + ’ sign 
before it. 


The listing of procedures GETMEITSYK and GETCH are 
given in figure 4.2. 

4 • 3 Syntax Analysis ^. 

The syntax of tho sourcoquery is checked in accordance 
x^ith the syntax diagrams of the language described in Chapter III 
It is almost a. 'one look-ahead' parsing scheme. Provision has 
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boon mode to indicate to the user prociro error messages in 
case there are errors in the structure nf idio query. Two 
procedures, ERiiOii and h'AP5 have been Britton Tor the purpose, 

A listing of those is given in figure 4.3* In case of errors 
the sour'^e query is checked a& far a-S poooiblc for any furthc 
errors uid th’' user is indicated of the same. The system does 
not produce the object query in case of a-'n'cactical errors 
in source querju 

A procedure namied OILICJTQUERY is C'-^lled by tho main 
program to chock the syntax of tho comnleto source query. 

It also builds up a number of tables required for producing 
the object query a,t r later stage. This procedure in turn 
calls two procedures COTTIEX? (to process the condition part) 
and ACT (to process the action part). CCJlDfAC is a procedure 
used by CONDEXP to check the syntax of oach simple condition 
in the condition part. listings of COTIdl? and ACT are given 
in Figures 4.4 and 4.3 respectively, listing of CRDCKQUERY 
is given in Figure 4.6. Row wo describe rho various tables 
the system has and builds during hnc process of syntax analyst 

4 . 4 Tables Maintained ( Diroc b orios) : 

For a given data base information about tho different 
relations and attributes of tho system is kept stored in tho 
form of tables of records. Thos' tables are referred to very 
frequently during the process of running a query. ¥o will 
describe these in dy:tail. 
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benin C.-JjaCw+U 
AOuGAHM 

GPI'‘.GAPAVAT.J»CN? 
GPl^»GA^'A5yM;«INTNUM8ERl 
with ro»^TABrCWl do 
benln 

it SyH«CQUNTSYM then 
beciin C?<DCP:*9? 

AtTNUCssO? 

GP'TUEXTSYMj 

if SYMsLPARFN then 
begin GETftFXTSYR? 
if SYMsTDEnT then 
begin GETRELNOtRELNO, Ih) r 
1# FEI.NO>fi then 
heoin Pei.Nf»Cs®PELNOit 
ttltflEXTSyMj 
if BywaSAMEsyR then 
begin NEW(Pi)j 
$AtYS9l«FS? 
sreoeti GeWxtsYMj 


then 


H 


|Ms 

J? 1 1 ^ 
Pli^;AftNOt 
CETHEXTSyW 

■^8t, EBl^OiRCl) 


Hf then_ . 
At?NOCATf«0,ID)? 
0>0 then 

HB^TATTlPFIf. 

A'ETNOi^jsAfTMOi 


' ^ f 1 

efSe EftRoilU 

until SYMtC.PHMAJ 
Pi l'‘,MEXTftYTs*nii ' 

. endj - - - 

if sywssniFFRREHTSva then 
begin wgw(n)i 
OSTTS^Jspi f 
repeAt SEWXTSYM? 

if AYMsIGfJjT then ^ ^ 

begin (;m|TNpCATW.|i>)l 
If AT'fNn>0 then 
Jin P|t;»Pl; 
iW(Pi)i 

•";HiEXrATTtaPl;- 
. ^ , ''♦ATTHObesATTSO; 
GEIHEXTSyw 

end 

else ERKOPtl) 

f nd 

Se EKROgCiJ 
until SXfSfCQ«MAJ 


f>ii 

endi 


.WfiXlATTsanil 




If then 

begin CHlTNEXTsyM; 

5?yv In tE0t*,IiSS,GTR,Ni50,LE0,GE03 
bealn COHOsaSXM; 

GFTNEXTSYMj 

If ’^YWalHTKUMBEP then 
beam VA!.9S=TNTN!I?4| 
if VaTj 9<0 then ERROR(?)j 
GF‘r«f:xTsyM 
end 

eJse ERROR (2) 
end 

else FKPORO) 
end 

else FRRURC4) 
end 

else EHR 0 R( 6 ) 
end 

else KrpoR( 6J 
end 

else FRROR(7) 
end 
else 

If syMsTOTAtSYM then 
begin CNr»cns»Sf 
AfTHOClsP&J 
GEWpSYW# ■ 

If then 

begin Gi^rWEXplENi 
if SYFwiDEWf then 
btfln CETfi5UMOCRE|,HO,IO>l 
if RElSO>0 |hen . 
begin ifefiLSOCt»RBLI«0? 

if SyMsSARhSYM theft 
begin v{5w(Pl)? 


sAftsSie^l# 
reoeet GETkmva? 

U syMaioFST then 
begin gETA'PTWQCATtWO.JlDj 

If ATTMOO then 

begin PilisPif 
aEwCPi)? 

PU*4«SXttTT:BPt; . 

GBWEXTSYM 

end 

else ERRORil) 
end 

else ERROR CU 
until SYafCQHXA; 
Rll'".flRXTATTs;*nil 
endj 


then 



if Sif^’sCUliQN then 
Heoin netMEXTSYMi 
it SYMajuENT then 
beoin GE!rATT«0(ATTW0,ID)f 
if ftTTp|0>0 then 
begin TAW:®ATr«0? 
gftnkxtsymi 

If SYMsRpAREN then 
begin GsiTNEXTSYMi 

If SYM In tE'Ul»,hSS,GTR,NEO,I.BO,GeQ3 
begin condssSYM,* 

GBfWgXTSYMr 
if SYWsththumrer then 
begin T0ID8ssI«TNUMBKRy 
VALSlSeTSTNUMj 
GETNEXTSYM 
end 
else 

if SYMsfteALNUMBEH then 
bealn T 0108 }sRBAtNUMhER? 
VAL8Rl«REAW«; 

GEtNEXfSYM 

end 

else FHRORCt?) 
end 

else ERROR (3) 
end 

else ERR0RC4) 
end 

else mmmx} 
end 

else ERR0R<I3 
end 

else ERR0Ra23 
end 

else ERR0RC61 
end 

else ERROR ($) 

else ERROR (7 i 
end 

^ ^L|is«ys?fei.w,mr • •■•, . ■■ 

l^'Rt^So>d then •' - ■ ’ ‘ 

begin REhM0CS»Reh«0| 

if 8Y«aR£Rl00^then 
begin GEtREXtSYe? 

I# SYMwIDCMT then 
beam GEtAtTMCiCAniiOrSOir- 
if ATX«0>0 then 
begin ATTN0C:«AT5f80f 
GETWgXTSYJ*! 

if 8YM in £EQt,gE9,G5S,l-eO,<!fR.SE03 
begin cawojRsifM? 
aer«£X'f$¥Mi 
U SYMsAGtSYe then 
begin 

If COW&tKOi then ERROR Ci 8); 
GETMEXYSYM 
end 




e 1 se 

If SSfMssStSR then 
henin c«DCn:B2? 

1€ COWDSjSOIj then ERRORCtg)? 
GETNEXTRYM 
end 
else 

If SXMrIDENT then 
beqin GETRKLMUCRELNO, ir>) J 
If RELMOO then 
begin CMDCDs*?? 

HFri^NOTjsRELWO? 

GBi'^exrsyM? 
if sywaPERlOD then 
begin GETNEXTSY^^J 
if SYMaiOENT then 
begin GgTA.TTNO(ATTNO,Ih) f 
if &TTNOO then ■ 
begin ATTWP7 :*ATTNnj 
if COHDftNEO then ERRORCS}? 
if ATT!I07#AT’TN0C then 
ERRORCD? 

GETNEXTSYM 

end 

else ERROR(l) 
end 

else ERRORCl) 
end 

else ERRORiS) 
end- 

else ERROR (6) 
fn4 

WalStJiORSiER then 
ft YA.l.45»IiiTHVM? 

REXXSYMf 

if SYMetORSYM then 
begin CnDCDsadx 
GETWEXTSYM 
eftd 




end 

elj. 

b<! 


HeBOtfftRSI 
n CNbCD:®? 
SEX1’SY» 

ERROfttR) 


then 


SY«aAi>Si5 then 
igln CWhCDfat? 
frrNEXTSYMl 

■ r lOSST. XKTWUR&ER , RRAI.»Oh»EStHften 
begin TQ|Di:ssSTMf 
case- TOTOt of. 

1DSHX:VAI,UJ ^ 

IHXNOMBERJVALI; 
end?- 





» ' 't' ^t>e* 

” j- ‘j>'s ' * ’ . , ' J. 



GEXNEXTS^M? 
if SYMsAPST then 
GEl'MEXfSYM 
else ERR0RC19) 
end 

else ERR0R(16) 

6nd 

else RRPOR(tO) 

end 

else RRRORC3) 
end 

else ERPOR(l) 
end 

else ERRURCl) 
end 

else ERRU*>(8) 
end 

else ERROR(6) 
end 


end 

end 

else 

if SY*>«*f.PAKEN then 
begin ADhGAMA? 

G^>1 , lAMASfM I ^hPhmn t 

GEtKtXTSYMy 

CnHhEX?? 

If Si^wPt'ftREH then 

beoin AUhGA*«Af 
GPi . GA>«ASyM : «RPAREI^? 
GKThEXTSYM 
' end 

else KRRUPi4> 
end 

else RRRCIR£1Q) 
endl 

begin CONOFACi 
wnlle dip 

begin AiioGAwAi * ■ 

QPl** 

G^'TNsix'rsyMi ■ ' " ■ 

COWDFAC 
end 
end? 

“SS}5,"®SS^i«oo . ■■ 

.^AdASy*S«ORSXfil! 

G'^fFEXlsyM? 

CnhhXERM 

end - • 

$ f r ^ ~ . 
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iroce re ATT * 


var 

I , J. »< : lnt«a<Rr? 

Kf i flj boolean } 

rrocfautf' TNStcoTCvar Islnteqer)? 
var 

vi; IntPger? 
f'l.AGjhooiean; 

Hentin J?sll 
T.AG! Kf aise? 
r^ueat 

if TAPATTfJl ,ATTNnr=ATTNO then FLAGi=truef 
J:=J+1 

until (FLAG or (TAbATTt.TJ .ATWOTsO)) 

Uth TAWATTm rto 


nectin 

If FLAG then REtNnTt«0 

0 1 S ^ 

begin PEl.N01‘i«RgL<40f 
ATr40T8«AtfM0l 
ACtCDStsOf 
ISet+1 
end 
end 
»‘nd ? 

betjtn T:aii 
if S1(«»GETSYH then 
bealn GetHEXTSYM? 
i# SXMsi.pAhEK then 
begin r.ETNFXTSt^st' 
if SYHasCutjjNtSYM. then 
begin ACXTAS*CC!«»tRE0t*tr(J«7 
GEt«eXTSYM 

end , , . , 

else '■ '' u' 

If sywnsTHTNyMto then' 
beoin ACTtA&.liThStUsTdT^yill 
ArTTA'f^,tMTREg*atr«ef 
GET^JEXTSyM 

if tijn - S£TNSXtS1f« 
else EBROR(4) 
endf 


ftf w**>» w<« •« «!e> 

J . * f , ' , - . - - , 


'f' 





f Si^sTDF^T i-hen 

beam Gr:'^'RFL^'0(RFLN0,i^) j 

if Rc.T,vn >0 tr>#‘n 
bfe^ln TeMpsiMjalOEMT; 

1 ^i^^TTlT] .aCTCD:»1 ; 
rtF'fMfcXTRYMf 
if RY^-sPERiotj t^en 
bectln rETKFXTSYi^i 
if RYKsTPFhT fhen 
beqin GKl’A'fWO(ATTNfj,in)j 
if AiTN0>0 then 

beam TAPAT'i-ri] . ATTNOTiaA fTNO; 

J :«T+i ? 
tiFtWRX'fSYWf 
while SYMaCbHfSA do 
beoln flF.TwFXTSlfM? 

If S¥M*idfnt then 

beaib GETRELNOfREIiND, ID) ? 

If, R|I.NO>0 then 

beg|.n TABAff CD .RCLNOfisRELMOf 
Git«EXTSyMf 
i# SYFaRgRinu then 
beqin GKtNFXTSYMf 
if sii»fslDFrtT then 
beoin GETaTTNOCAWODD)? 
if RTT,<n>o then , 
besin jABAI’fllJ.AfTMOfmATTHOl 
llBAfrCti.ACfCOmtJ 

* f «»T#i f 

«n<l 

*lf9 ERRO»<D 
#«« • ' : 
else ERRORCD 
end 

«|i4 ERRORC83 
en® < ' - ' ' 

«ise #|IR0RC6) 
eno » , ' 

end 

end - • - ' 

else ERRomtl “ 
end - ' 

else EERQBCt) 
end - • 

else EftRORtB) 
end ■ ' - 

else 

endi - ■ . ' 


:i2«SLt^:^^»4SS. "■ /'''xv. 

e» ■■ ee erifSwW ■•►leirwii'^HPPji^eprwtie,^"^ ^ > 

” "* .ii *' t”" *' 1* "f - ' 



47 


if r>yfi=-TuTi)kT,sY.: then 
benln f^KTf'FX’^SyHl 
(tfTTjiP.TTi KFgsstrney 
if rY‘5=T<FAHFfj then 
heqin CtTiSFXtsYMt 
if sxMsIbfw'T’ fnen 

henin Cf thfliHX RFL^O , Ih) ? 
if Fef,rin>o then 
beam ACTTA8.TREtj;=f<Fl.»J0r 

wEThKYi’Sifn; 

If ?/h=SAME5iM then 
hehln ME^CPD? 

ACTTAR,?iATTS:aPt ? 
repeat CfelTNEXTSYaj 
tf 5?iMsTDFNT then 
beoln CETATTKO(ATTHO.in)j 
if A7 'TmO> 0 then 
beoln Plljapt? 

JNfSERtCT)? 

NFWfPD? 

pll'‘*K’eXTATTj»Plf. 
pi I'^.RTTj'iOl.SsATTNn; 
GFT*}EXfSYM 
enrt 

else FRRORCl) 
end 

else FRRORCl) 

Pjl-lwIxi’AXtVanil 

end? 

if SY«acaLOM then 
bealn GETfJFXTSYW? 

^UMKrSB^Swo.iD), 

'LSirKTSgrr.ITS.ATTHD, 

TAbAtnt] .H£hHOT:s:R||«|l 
TAtiAT'l'nJ ^ATVWU'l'iaAttllOl 
•r»H&TXn3 ,ACTCt}taJ t 


ciFTVExxsy:^^ 
else FRRQRC4) 

end 

else fRRORd) 

^e|S 2 ERRORtU 

else r.HRORCt2) 
end 

else FRRtllRiS) 

else ERRORC^^ 
end , 

else ERRORCt) 

end 


then 






IS 


If then ERROR{lyy? 

■i'-'. ts'iui ; 
if syMsi pahfn then 
heuin AC'S’TAa.SR fREOjatrue; 

.vh.'fiiPI ) ; 

Ai" ! rAP,PsnRT:=!SPi? 

rnteftt r.KTNrxnrsYMj 

if f^YM In tHPSYM,no«NSYM3 then 
heain 

if SYHxlIRSyM then ‘fPIMPSORTjaUP 
Pise TfcMPSOP’iSanOWN? 

UFi'lieXTRYMf 
If SY^sTUF'-iT then 
hecfin CKTHFLhO(RF|.NU,ID)» 

If Rfc’LNOh tnen 
henin SPtl:»SPl; 

.’4FW(SPi)| 

iJPlt**,SnRnePsaTEf«P§ORTi 
SPl! '‘,riFXTSnRTj*SPU 
GEi:»«EXtSYf^| 

If symsperxoo theh 

beoin CETNFXtSYMi 
if SYFsIDFNr then 
htoln rK7ATlS0(ATT*l0rID^| 


if AfT«n>0 then . 
teeoln $Pll^.ATTNOS:aAnilO| 

'fJEXfSYJ 


GF 

©ise ERROR Ci 3 
end 

else ERRORS ) 
end 

else ERROR t« I 

I nd 

se E«R0E(6) 

end 

else ERROR (6) 

else ERROR a »> 
ant 1 1 SYpcoRRAf 
SPiJ*.r.EXTS0Rl*8»ftllf' 

If SYHsRpAREJi then GETcJEXTSYK 
else ERROR (4) 
end 

else ERRORCli) 

■nd| 
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ore'^nrc ( 

procf'dur^ 
r'H'’CP'ulihF’ »CT,* 
beoin 

jf Pi '■sf'fi'inTrTOMsym t^en 
becfin (^L’trTXTiiYh? 
if Pj *=C(!T.u*' then 
heqiP ntTNKXTsYivi? 

G?’! !=nn ; 


SfK^ 

else EHPuhifi'* 
en'^ 

els© P:«t^uR(} 4 )i 

If 5Y'^»«^CTinNSi^< then 

beam ynlle SYV#ACTXnp!SYH do GFTNEXTSYMJ 

ERHOkf^O) 

end? 

If SY*^=C(3L0r? then 
begin Gbti'.EXTsyri? 

aC'{ 

end 

else ERRuJl(12) 
end? 

^!f SY^alUEkT then GFfNEKJSYM* 

If SY'^atiRAKF^ then, 
begin GETf^KlTSYh? 

If SYi^alUENT then 
bealn XSPACEsalO? 

GEI'WEXTSYm; 

If symbr^aken then 
benln GFt«FXTSYi^f 
btOCk 
end 

else SKllOP(4) 
end 

else ER»0«(t5) 

else FRhORlYj 
end? 


FIGURE 4.* 9R0i€:‘«nW 

(«en*OTeijnei4i«e|k^ei» *• W«p 
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4.4.1 piroctorA'' o f AttriTjutos in tho Sy^toni ’ 

This is nn-Cied AT'T’DIIl. Tt is oJi oi dimension 

POATTS, tho nurrhor of attributes in tho systeo. Each element 

of tho array is - record of the type ATTFIEUTE. This record 
has tvro fields 

A.TTNAMEP ; for the namo of tho attribute j 
and TYPEATT ; for the typ^ of thv; field, i.e., alphanumeric 

(IDEYT), integer (IFMIDIPL'P) or real 
(EEALl'TUMBEP) . 

HOATTS, the number of attributes in the data base, 
is declarod as a constant in tbe main program. Per our example 
System it is set equal to 12, It has to bo cb.anged for a 
different data base. Tho sprocifico-tions of tho attributes arc 
road in from a file named 'ATTIEF' by o procedure DBFIITEATTS. 

A listing of this procedure is given in Eig. ■1.7. Bach lino 
of the file ATTIEE should define one attribute. The details 
should be given in the folloA'ing order ~ attribute name and 
the type of the attribute ('I' for integer, 'E' for real 
and 'A' for alphanumeric). There should be POATTS attri- 
butes in the file. The order in which these a,re given is 
immaterial. The system stores these attributes in ATT DIR in 
the same order, l/honovor nn attribute nnme is encountered 
in the source querv, a procedure DETATTP'O gets the sequence 
number of the attribute (ATTNO) from the table ATTDIR and 
uses this number for storing in the runtime tables. A listing 
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l^roce'^'jre i-FF T,\fcV,TTt?; 

VdT 

f : intP tei ,* 

fpjcr.ar J 
oeim 

ps’rSP'Tf’F t • i^rri'T.r* it 
for r: = ! to 'hIATTS do 
■'.r't'oihfn do 
Hp-rm '';i',TtAf',F:tTu)i 


•h t. 


*•’. ’•'ir.Tp 

jf,. r 1 I ^ 




, ’.ivHCHAOy). 
if Tp in tnt^n 

ca-be Tf- of 

:TYPFATT»sTf4''’fJl4!BEI>| 

* P ' : 'i Y pp" A T 1 1 ao t: ft U" aER ? 

* A * • I Ypp;A'rTs*TDP’AT 
t;nu 

i*1 S^* 

ufiTT^j.W( Error tn ATtner 

rif !.T 
efid 

<»nd 


ATTNAMEDJAD? 


an' 


oroceidra fUi'^ATnA'uCvar ?»?inteqerHT}j Al,rA)l 
var 

X ! Intag^r • 

benin Jsssl? 

if ( ri<r.-1ArxR)orC(IitWAfTS)and<Atfr>IR£I3 .AnHAMEDs|0>3 3 th«n 
else 
end? 

oroceddre nESCATTS? 
var 

irJf Aslntegreri 

l^tkhWht 

begin ftESfc;TCf , 'TABl-ESn# 


tor i5»t to Nfi&TTS do 




if ATTOIRfll ,T¥OKATt#II»TOFtSfR then 
heoln g£T4AMS!f 

* " '“TWAMEO theft 


1111 !;^- 




If TO<fATtDJ 
beam 

H AijiT 

end , , ^ . 

else '* ' • ' ’ ' 

begin PEA»<r,a)l 
tor K.i*l to ♦J_4o , 
beaift . V-' 

TA&tKfX1tK5t«fu ‘ ", 

end ■ - 

r end ' ' 

end 

'* .end > ■ 

end! ‘ - 

^unct ion sCAi^CMCMi iftteaerftni A^rA)tfnt«geri 

var . 

Xunteger? 

Fl-\^:boQi#sni - . ' , 

begi'' Ii«it ' ‘ v!-u ' 

ri.AG«»leis«i - ' ,. 

while notCrtA^ ©E fmtil de^ 

, ^fl^TOeTAJif.s£«1 IJ3 then fliAGr«trbe 
' •• else rs»i+i- . 
end? - '/ 

. it FtAG then ssAaCHjet 
else SHBOHClb), 

-end? . .- 




•risswie ' f 



52 


of procGdu.ro G-'.'.'TATTITO is given in Pi%"ur‘j 4.7. A number zero 
is returned by uho procedure in case th.o identifi>-r In the 
sourcG quor.y, which is supposed to bo an attribute name (as 
per the synta'-e cf the language), does net match with any of 
rhe attribute- names stored dn the directoiy. An error message 
is printed out in the case. 

For each attribute which is net of the type intogcr, 
we road In a table which associates a unique integer number 
with each different type of value for th'^t attribute. These 
tables arc read in from a file 'TABLES’ by a proceduro 
nBABATTS, A listing of this procedure is given in Pig. 4.7. 

The file 'TABLiilS’ should contain the name of the attribute, 
the number of different values of the attribute and the 
actual valU'.-s of the attributes for each of the attributes 
which is not of the typo integer. 

4.4.2 Diroctoi. of Bclations 3torod in the System s 

This is named fiELDIR. It is an array of dimension 
}TOREL,SP. ROREIS is the number of relatione originally stored 
in the system. 

NORELSP = NORBLS + 20. 

This number is chosen such that we can store all the inter- 
mediate rolatioris built by the user, nach element of the array 
REIDIR is a record of the typo PL'RATIOR. This record has six 


fields 
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and 


riEMAr4ED ; 
ATTD ; 


ICSYD 


PRSTPL 


RTPLS 


PELPL 


for tile najno of the relation 5 
for the atbributes in the relation. This is 
an array of integers (ono integer for each 
attribute) of dimension KTO^TTS (to take care 
of tho exti'Gmo c'^ec ’'^'licn tioro is a relation 
in the system which contains all the attri- 
butes in tho system). Those integers are 
the same a,c the seauonco numbers of the 
attributovS in ATT DU";. Order is maintained 
and a 'ZERO’ means no more attributes beyond 
that index; 

for the key fields of the rela.tion. This is 
also an array of the same type as ATTD; 
is a pointer to tho first tuple in the 
relation. (Each tuple is a record of NOATTS 
fields. In addition it has a MRKing facility, 
used at the timu of implementing the alge- 
braic oporotions, and a pointer to the next 
tuple in tho relation.)? 
for storing tho numb..-r of tuples in the 
relation ; 

for storing tho name of the file in which 
the relation is stored. 


NOJiriS^ the number of relations defined in the data base 
and HOEEIiSP, the total number of relations permitted in the 
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system nxo n.lso declared as constants in tlie main program. For 
our example system tlioy arc set equal to 4 and 24 respectively, 
Th so have to bo changed for a different data base. The ^eci- 
fications of the r'‘lations roac in from a file 'RULDyF' 
by a pTOCoduro D'^’FIEu??LS. A listing of this procedure is 
given in Pig. 4.8, Hach relation should be dofinod in this 
file. The details should be given in the following order - 
relation name, the number of attributes in the relation, the 
nanos of the attributes in the rela-tion, the number of fields 
which together form the key to the tuples in the relation, 
the -ttributr names which form the kev, the number of tuples 
stored in the relation and finally tho nemo of ^he file which 
contains tho relation. There should be d -'finitions of ITORFIS 
relations in this file. As in the c-so of ATTDIF, the order 
of ro3 itions in this file is immaterial. The system stores 
these relatiOiis in the same order in RSLDIR. ^’’honover a relation 
namo is encountered in tho source query, a procedure G-ETREINO 
gets the soouence number of tho relation (EEL’iO) from EELDIE. 
and uses this number for storing in the runtime tables. A 
listing of procedure &ST5S1F0 is given in Fig. 4»8. An illegal 
relation na.me in the source q uery causes an error message to 
be printed out for the user. 

Bach of the relations in the ^stem is stored in a file 
each. The names of these files is read from the file ’EBIDEF’ 
and stored in directory EELDIR as described above. The relations 



orj’f’'.re uP’t* 7 

Vdr 

j ; Int^aer ? 

fc 1 1 n 7^ f SLT ( F , ' hri.r'tr ' i ; 
for J:»1 n* MiHFijK do 
* t Cn 1 1 f i J do 

-•,e-»in v,f:T!MA'-’ECir>) j 
FFi/.H = 

tor <£ = 1 ♦■o u do 

oo'^in «:c;TiMAMFPf IP) • 

4atfeTTkP,TDh 
if Ai''’'r«n5Bd fhen 

^HXTFLM*#* Flrror In RKLDEF »' ,RFI>MANiEDtAl,>? 

n LT 
eod; 

AT-l'OlKJ taAXTj^iO 
<»n 1? 

tor h:a.Ul to '-vnATfS do AfTD£K)!»0? 

F«^A*nF,d): 
tor Kjst to d do 
bedin HiTf'tAHFOJt IP) r 
GFl.MT-Mn(ATTA0,lD5; 

If A'lT^iOsd then 

beain Frror In relhef -Srel«aM£D:Al>? 

iiAbT 

eodr 

.•^rxot'ci jaAi’Ts^n 

for KsaJfl to rtHATl'S do KEKOCKJfsOj 
HFAnCF,?FrPLS)l 
r,ETdA«K(in)| 
aE»jFl,;*lp 
And 
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are read by procedure READ^^EJ.y, a listing of -^hich is given 
in Pxg. 4.8. Eo.cIi file contains the values of the attributes 
in the same order in which the attribute names were defined 
in ' RnLDEE ' . The integer values are read and stored in the 
tuples O.S such. The non—intege: va,luc‘e are rend and their 
corresponding number is obtained from the tables, which have 
been read from the file 'TABLE'S' . These numbers are stored 
in the tuples, a procedure SEARCH, a. listing of which is given 
in Pig. 4.7, does the job of getting the number for each non - 
integer value read. 

4.4.3 Commo n Att ri bute and C o mmon Rolnfcio n Tables ; 

Those are the most important tables in the system and 
are used at the time of producing the object query. The basic 
assumption in our system is made use of in these tables. To 
state explicitly, the assumption is that two different rela- 
tions in the system are either related through a common attri- 
bute, or, if there is no common attribute between the two, 
there is atmos fc one other relation (in the user's sub-schema) 
which relates these two relations. In other words, for two 
relations R1 and R2, either there is one and only one attribute 
A1 which is common to both R1 and R2 (tliis is true in case the 
relations are in 3RP) , or there exists atmost one other rela- 
tion R3 such that there is an attribute A1 common to both 
R1 and R3 and there is another attribute A 2 common to both R2 

and R3. 
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In c^se there is no ouch A1 (coniuTon t:o r-’nd 72) and 
thcr'^ is no such 23 (common to 71 o.nd 72) and the source query 
involves tvro such relations, then the object query is not 
produced and the user is asked to break up his query into 
simpler subqueries so iiiat the above icsumntioa becomes true. 
This broakiny up is alwiys -oossible if the query is msaningful. 
It has to bo pointed out hero that in aost of the queries, 
commonly made by a genuine user, such a condition will not 
arise. The provision of giving this moscatTo to the user is 
required so that he may know that either his subschema is 
not comploto or that his query is extremely complicated, 

Bp.sed on this assumption, we build up two tables -COMATT 
and COMREI/ to store all such information. Both these tables 
are two di 'ensional arrays of .size 'MOPBISP X N0P71SP. The 
enteries in these tables are meue according to tbs following 
rules - 

C0’'’IATT [ijj] = 0 if i and q ar"* two relp-tion.s that have 

no attribute in common, 

COiyiATT [i»3] = I* attribute that is common to 

relations i and q. 

C0'''IRP1 [i»D] = 0 there is no relotica in common to 

relations i and o, or OO'^'ATT [isO] ^ 0. 

COBIRBj'j (iy j) = k if relation k is common to relations 

i and j, and COmT fi^j] = 0, 
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Here i,j anc- k are the S‘=^queiac'^ iiUitib«=rs oC the atTrihutee 
and relations in AT'T'DIt' and RkLDIR respective"’ 5^. These tables 
are initialized (for the relitions read in) by the procedure 
ILKIOlTA^o. A listing of this procedure in -iven in Pig. 4. 9. 

In esse of a multi-step ouerv, after the processing of 
each sub-ouory, these tables ore updated to include the final 
relation built by the user in thet sub-qu ry. This is done 
because tha next sub-query is very likely to involve the 
relation built by the previous cub-auery. A procedure MDPCOMTAB 
has been written for the purpose. A listing of this procedure 
is given in '^’'ig. 4.10, 


4 . 5 Tables Q uilt (Run Tim e) ; 

During the processing of any que-^y, information con- 
tained in the source query about the type of the condition part 
and the attributes :,ind relations involved therein, and the 
action required and the attributes and relations involved therein 
has to be stored in some form to enable the object query genera- 
tor to perform its task. Now we describ- thesa tables in details. 


4.5.1 TablPi fon Condition P art: 

This has bean giTCa the n-r.o OOKTAB. It is an array (of 
size 10) of records for the general case of a oomplex oonditaon. 
This record has four fields oonur'n for all types (refer to the 
7 condition types described in chapter III) of conditions and 
different number of more fields each depending on the type of 

the condition. 
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Pirst let us dt-'fine a vo'dable CTOD for the condition 
codes. It is ot the type COFlDCOnF' Yhich can take integer 
values from 1 to 9. Each of these va,lues (except 6 vjhich is 
not used presently - it can he used for adliii^' any future 
■fcypes of conditions) corresponds to one of the condition types. 
Thv four comraon fields in record COVlfAE are - 


REMOC 


ATTfTOC 


COND 

and CEDCD 


for storing the 3'elation number of the relatio 
on the left liand side of the comparison operat 
in the condition^ 

for storing the attribute number of the 
attribute on the- left hsnd side of the com- 
parison operator in the c'''ndition. It stores 
the value 'zero' in cas« CN'DCD = 8 or 9. 

(Por CHDCD refer next pnge)> 

for storing the comparison onerabor involved 

in the condition j 

this stores the typo of the condition. 


The variable fields for different values of .CITDCD are as 
f ollows- 

CNDOD = 1 (relation. attribut'^ <00i\TD> " value-') 

VAL1A\ to store the value within the quotes, dependin 
or \ 

VALII f ; upon whether it is alphanumeric or integer or 
or I 

VAIIR^ real respectively. 

CNDCD = 2 (relation, attribute = 


No special fields. 
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j 

First let us define a vo-uable CFDCD for the condition 
codes. It is ot the type CONDCOE?' rhich can take integer 
values from 1 to 9. Each of these values (except 6 which is 
not used presently - it can be used for ad'’. in^' any future 
types of conditions) corresponds to one of the condition types. 
Th-, four comrQon fields in record COelTAE are - 

R-ULFOC *. for storing the 3'elaticn number of the relation 
on the left Iiand side of the comparison operator 
in the condition^ 

ATTFOC : for storing the attribute number of the 

attribute on the left hsnd side of the com- 
parison operator in the c^'ndition. It stores 
the value 'sero' in cast:? CFDOD = 8 or 9. 

(For CHDCD refer next pnge)? 

COITD ; for storing the comparison onerabor involved 
in the condition} 

and CHDCD : this stores the type of the condition. 

The variable fields for different values of CITDCD are as 
f ollows- 

CNDCD = 1 (relation. attribut'^ <COhD> * value') 

to store the value within the quotes, depending 

OIT \ 

VALII r s upon whether it is alphanumeric or integer or 
or T 

VADIR' real respectively. 

CNDCD = 2 (relation, attribute = *) 

No special fields. 



=. 3 ( relation . attribute = ALL) 

I'''o sp GO ial f i eld s . 

Sll5PP-ilA ( rol'^ o I jn. attribute <C0ND> n TOP) 
"'’'AL' ' for s to ring the v- lue of 'n' . 

ClyTLCD = 5 (rol-tion. attribute <C01'TD>n BOTTOM) 


VAL4 . for storing the value of ’n' . 


Pifper 

CNDCD 


= _6 (not used prscntl 3 ''') 

=._Z ( relation. attribute <> relation. attribute) 

PJlJiWO? ; for storing the relation nunbor of the relation 

on the right hand side of the' comparison operator. 
A'TTN07 : for storing the attribute number of the attri- 
bute on the right hand side of the comparison 
operator. 

CND CD =. 8 ( TOT A.L( relation SAME attributes; at tribute) <COND> value 

pointer to a list of nodes of the type ATTNOD h 
This list contains tho attribute numbers of 
attributes in the SAITB list. 

for storing the attribute number of the attri- 
bute of which the total has to be checked. 


3ATTS8 


TATT8 


YAL8I\ to store the value on thvs right hand side of 

the comparison operator, depending upon whether 

VAL8R7 ^ , 

it is integer or real respectively. 

CNDCD =-- 9 (COUNT (relation SAT'IC attributes DIPFB^ILTT attributes) 

<C0ND> nositive integer) 
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SATTS9 • pointer to a lie c of nodes of t‘^T?e ATTtTODJS. 

TMs list c ■'nto.iiis the attribute numbers of 
attributes jn the list. 

DATTS9 : pointer to the list of attributes in the 
DIFPBFdlNT list. 

'VAL9 ■; stores the non-negative integer value on the 
right hand side of the compai'ison operator. 

4.5.’ os fo r Action Pa rt; 

Two tables aro built for the purpose of storing infor- 
mation in the action part of the source query. One is used for 
storing the numbers of attributes and relatioro that arc reauired 
to nerfoiTi the task asked for in the action part. Other is 
for storin ^ the final action required on the relation to be 
outputted to the user’s work space. We discuss these tables 
in greater detail now. 

Tt_ble of At tr ibutes Required; 

This is nonod as TA3ATT. It is an orras^ of size NOATTS. 
Each element is a record of type ATTSUSBI). It has 5 fields - 

ATTNOT s for storing the attribute number of the attribute 
in tho list of attributes in the action partj 
RELNOT ; for storing tho relation number of the relation 
in whose context that attribute has been usod*, 
and ACTCD ; (ACTion CoDe) for storing the reason for which 

that attribute is used in the action part. This 
stores the value 'zero' if tho attribute is not 
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to be output tod In the final ralation (and 
has to be brcujht t omporari I”' for the purpose 
of sorting- and/or tot-^’’ 3 iiig) . If it is on 
output attribute then this field stores the 
value ' one ■ . 

In case sortin/^ end /or totalling is required before 
finally outputting the relation, the field AoICD is made use 
of to remove those attributes from the final relation which 
have ’toro' in this field. 

Table for Actions Required : 

Tills is named as ACTTAB. It is a r'^xord of the type 
ACTION which has the following fields - 

SRTTJJBC' ; a boolean variable which stores ’true' if 
sorting is required before outputting} 
rSORT ; a pointer to the list of SO^TNODES. This 

node contains the attribute number (ATTNOS) 
and sort order (SORDSf) of that attribute. 

The first attribute in the list is the major 
field and rest, if any, ore subsoouent minor 
fields for sorting. SOED^S scores 'UP' or 
'DOWN' as the case may be? 

TTLRBf ; a boolean variable which stores 'true' if total] 
ing of a certain domain is rcouired} 

TPJDL : for storing the relation number of the relation 
involved in doing the totalling*, 
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" a pointer to tlie list of ottributc numbers 
of attributes in the SAME list} 
i for storing thr number of the attribute of 
which the tot^^llin^?: hro to be donoj 

COlL'.i, boolean variabl . which stores ’true' if count 

of the nujfbor of tuples in the final relation 
is rer, Hired } 

LMTEEn • boolean va lablo which stores ’true' if the 
number of tuples to b'* outputted ir to be 
limited to 0 certain number} 

LIJ4F0 ; for storing the number of tuples to be out- 
putted if LrlTEZir. is true. 

Those two tables, ACTTAf and TA?’ATT are cleored before 
the processing of my query (or sub-ouer]?" in ca.se of a multi- 
ctep ''uory). This is done by the procedure CIBAKTABLSS, a 
listing of which is given in fig. 4.9. 

4 ^ ph j oct c u ery G on era, t i on > 

The runtime tables, that have boon just described, and 
the system's maintained ta,blcs, ihat were described earlier 
in this chapter, are together used to convert the intent of 
the source query to statements in the object query. These 
statements are the usual relational algebra operations like 
project, join, divide etc. 

A different sequence of steps is required for each of 
the different condition codes (CTOCD) stored in the condition 


SATTS 

TATT 
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tT,blij (CONTAB). Tho num'bors of tho rel.ations ijivolvod 

-aro storod in COITA'j and TAT3A'^'T. ¥g donotc- tho rolation 
and tho attribute on the loft hand side of t'le co'^iparison 
npeT'ator in the condition part as RC ■c’nd AC rosp ?ci:ively. 

One idor has to be brought o\^t w'lich is c'’nrri''n to oil 
condition code.s. It is that lARATf contains the nuoibors of all 
relations (along vdth tho attributes) tha.t aro to be brought ^ 
to the work space. Pour different cases are possible : 

(i) A’’! the I'clations in lABAfT ar= sorae as PC, 

(ii) There exist relations in TABATT which "ire different froni 
PC but have an attribute each common to it. (This can 

bo checked from the COtlATT table described earlier), 

(iii) There exist relations xn TABATT which arc different from 
BC, have no common attribute with it, but havo a rela- 
tion each common to it. (This con bo chocked from the 
table CC'^REL described earlier),, or 

(iv) None of th- above three,, i.c-. , tho relations, different 
from EC, in TABATT do rob have oven a relation in common 
with RC. 

V/o will describe, after a while, tho soauence of actions for 
each of tho condition codes for case (i) only. However, the 
system takes ca^i'c of all ca.ses in the following way - 

A procedure CHECK (FLAG,l) chocks the ralo.tions in TAnAT 
to SGO if they are different from EC. If it finds no such rela- 
tion then the value of FEAG- is set to 'false' (case (i)). 
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^jlse j. Lag io sot to ’true' and then 'I' points to that entry 
in TA3ATT which has such a ’-ul-tion. A proceduro DUIIL is 
invoked if FLAG is found to b ■ true. This procedure joins to 
RC a,ll tile rcla.tions in TA3ATT which are different from RC. 

It scans TA3ATT completely and for each of the relations 
found to bo different from RC (and any of the previous rolo,-- 
tiono encountered in TABATT so far) it chocks up tn,blos 
COMTT and COTRiilL. Let R1 bo such a relation. If COHATT [Rl^SC] 
is not zero (case (ii)), then it joins to RC (or the relation 
built up so far as a result of previous such relations >.ncoun- 
terod in TABATT) this relation Rl. If it is zero then it goes 
on to checking COIIBEL [El,RCj, If this is not zero then 
(ca,se (iii)) Rl and the relation from COMREL are joined to 
RC (or the relation built so far). If this is also zero 
(case (Iv)) then no actions are taken as it me:ans that the 
query is either very complicated or it is meaningless. A proce- 
dure DI-iMP is then called wiiich .asks the user to split the query 
into simpler sub-queries and run af:ain as a multi-step queryt 
This is always possible to be done if the query is meaningful. 

A listing of procedures CHEOII, BUILD and DUMI? is given in 
Pig . 4.11. 

Now we go on to describing the sequence of relational 
algebra operations to be performed for each of the 8 types of 
condition codes for case (i) only. 
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C NDCD = 1 


i ^ COK'D > ' V i.l n. 0 ' ' ) 


In tl\is c ,so 
brinsjng only those 


RC if) pro^octod D-'r -i-’ie pc aaif-od attri'bute 
•uupleB fo' u-liidi 'A'J <C0'' D> vsluo’ is 


• tis Tiod , 


r'T-^OJOCT 


DC 


nttiibuocs in TilA". 


•?TT' 


AC < ( ‘OlTD > vnlus j 


CFDGD =_ 2 iRC_^A.C * J 

This is convortod to oust the following -- 
I'ROJTSCT TJC BY attri'butos in TABA'!''’’? 


CNDCl ) =_3 (RC^AC =, AlU. 


This is converted to the folloxring thref steps - 
PROJECT RC BY attributes in TABATT and ACj 
PRO TOT RC 31 kOs 

DIVIDE first relation RY second rel-tii-'ni 


C NDCD = & (RC.AC <C0ND> n TO P) 

This is 'Converted to the folloxArin>'^ steps - 

EI’N’D n-th value of 4C frou tonj aid thon like CWDCD = 1, 

PROJECT RC BY attrib.itos in TABjMT YHERE AC <C01'ID> vnlue; 

CEDCD = 5 (RC.A C <C01D> n BOTTOY) 

Here the steps nre same as in CRDOD = 4 except that the 
n— th vnluo from bottom is found, 

CNDCD = 7 (RC.AC OR7.A7) 

Here AC is sotie as A7. This is converted to - 
PROJECT RC BY AC} (giving rolation 1) 

PROJECT B7 BY kli (giving relation 2) 



70 


3U?, TRACT rol-tioa 2 ri?0''l rel-ition I 5 (civing rolptio-. 3) 

JOI^r RC AFB relation Cinally, 

P OJRCT aOovo re3 ' I nttrioutes in TA’BATTj 

£lAppIl=€l /JT-QZAiJl ..( Fp .0 AFR_jrlrtri:m^^ j^S ) <0f~'''3'D > v-luej 

-hi.T oriv'C'-t, d 1 :''> or. .y ono sto'o b "t the pio joctJon 
inv'^lvv.^cl is ’lO rc oomplic-' bod t]v\n orc!in'''j^r projectio)!. The 
stop is - 

PROJECT RO BY attributrs in TABATT YHrRO (total of AS 
for the same o.ttributes ) < C 0 '''''D > value j 

SilPPjfeP, iCpiJHT (rc 3AMB ^a^ttrihut Gs_ tr ibu tes) 

< COFR int eger 

Here 'Iso the projection operation is slightly compli“ 
cated . Th"; step is - 

''hero 

PROJECT RC BY attributes in TA3AT'J’^'( number of tuples with 
the same attributes and the diffe‘‘cnt attributes 
<C01’TD> integer? 


A p 3 rocedure RTMTAl-bCOURRY chechs the CITRCD for each simple 
condition in the condition part and produces suitable steps of 
the object query as explained abov'='. A listing of this proce-' 
dure is given in Pig. 4.14. 

The procedure COFD 'XP whic'’-- chocks the syntax of the 
coranlete condition port (which could be an arbitrary expression 
of simple cendibions) builds up a list structure to store the 
' infix form' of that part. This list is pointed at by pointer G-P: 
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k procedure is written to convo, this infix notation to 
’pro-fix notntim'. This proc:durp is n^nied OlTV'PTfTOPF 
■^nd listing of the snmy is given in Tig. 4.. 12. This prefix 
form is tlfon used to produce ob 3 e"‘t ni^orj statements (operations 
TTiJTOh -^nd IPfrPS’HlOTIOI'' ) by - procedu-’e CliBuFlL - . a listing of 


which is given in Pig. h.l^. This procedure 
the fin'll relations (of e\icli sinnle condition 
relation which, can be outputted after actions j 
table ACTTAB, have been performed on it. 


combine 0 all 
) to produce a 
as stored in the 


The four possible actions required to be taken on the 
final relation are stored in the table ACT TAB and are - 

(i) finding tho total number of tuples in the final relationj 

(ii) doing the total of certain doimain in the final relationj 

(iii) sorting tho fin?il -oolation on some domoinj and 

(iv) dimiting the number of tuples to bo r'^tained in the 
final re"' at ion. 


A flag for each of those is kept in ACTTAJh ''^'e note that if the 
count of the number of tuples is -'■eouired ther none of the 
other three operations are required to be performed. Otherwises 
any combination of the other three operotions could have been 
asked for. The seouence is - totalling, sorting and then 
limit ing. 

A listing of procedure RlTNOUEBj". which performs all 
these actions, is given in Pig. 4.15 for further details. 
Finally procedure OUTBPL, a listing of which is also given in 
Pig. 4.15, outputs the final relation with proper formating. 
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One more thing to be pointed here is that vro have 
built p, numbei of in.terme!3io,to rolntions during tho process 
Oa oxocuting a Query. Each of those rel.itions is to be 
stored ,as soon as it is built so that the further operations 
can aikc use of them. 'IRL’J'' ic p, glob'll varia.ble which 


<^lw\ys points to thr latest }. jli hion built bf the system. 
Each rolfition is p.lso given a namo by the system. This is 
done by .a procedure GITEFAME, a listing of which is given 
in Pig. 4 . 13 . 


In case of a multi-sfccp ouerv, the final relation of 
each sub-query is given the name specified p^s the workspace 
area for that subquery. The entire procedure described so far 
is repeated for each of the sub-queries that fol3-Ow. Only 
difference is that the COMATT and the COFREL tables are 
ffiodi'^'ied and all runtimo action h^.bles are doored before 
starting with each of the subsoouent subquery. The last sub- 
auory ends with a semicolon and then the final relations 
named by the final sub-querj’', is outputted :!fter proper 
formating. A listing of the main p-rogram is given in Fig. 4. 16. 


4.7 Summar y ’ 

In this chapter we have stiown bow the system converts 
the sources query (in the newly specified language) to rela- 
tional algebra operations (which we hove called the object 
query). We have described in details the tables and procedures 
required for the purpose. Listed in Pig. 4.17 are the outpubs 
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of Gf.ch of the expjnplo qaorlos considerv d in Chapter III 
(Qucryl through C'uery9). One nero query lies been, included 
to Pho'" that th,i s;ystem works satisf ''Ctorilv for a multi-sbep 
query also. The word statonent of this query is ■“ 

" uory 10 find the names of those suppliers rho supply at3 cast 
1 ! 

all those jobs thot are supplied to by supplier 
In our language, this will be written as - 
i;'ir:RY10(¥l) 

gMVTH 

CONDITION ; SUPPLIER. SITAMS = 

ACTION ' GET SUPPLY.!# 

. QUERY10(¥) 

CONDITION : Wl.J# = ALL 
ACTION ; GET SUPPLIEB.SNAMS j 

The output of this example query is given in Pig. 4.17. 



OUKHyif 1 
COHOI 

feCTlONsGFK J 1 PART.l^^*AMt;,PRKT.ni 3 H; 


Onery is o.k,, 

fttiSMPi s !!:2£E! Qf^H 

rrunc^ite -’iF-MPl keeoinq only the first 3 tuples. 


.:■ 1 s D I 


a VfF.rPl 


^ '* 1 1 q uroDer forinattina 


PuAi^.F 


HUM 


MijT 

scRt;itf 

CAH 


4 

3 

2 




OUF!PYa(’#i) 

COAJOXTTOMsPAPT.OOH*! ?0P- 
ACTintisaK'i' PART, P#, PART, f>NAMF? 

Find tij** tth Mgftisst value o£ 0OH in HRT 

*fr«MRi s PkojFCT P4PT er f# , .pMAHe- . 

5, » WTPMPi . ' ' .,' /':' - . ^■- ‘.‘ \ ..K-'. 

fill « |«l( « « « f 


$. 


Pf 

«ta N» 

4 


SCREW 


' "-\r^CV 

I ^ i, 1 <■ - ' 1 . J 5 -^ ” 'V 

, ,J/?, ^ “<>'■' ■ 

'* X' A' t J* -V ’ • ' ''r 

;. '-'if' . .-V ■ ■' "' '• '.’.'^'f' - . - - 


T 


* . - >, ' 1 l.'.'x' 

. ■'■i' -/'. 

^ ^ 1|[ 4 ; 4t 4; t ^ « !I|C ^ 4: # 4c « 4t f ft ^ ^ ^ ^ ^ ^ jfs % 3|L4r# 3|<rir 






OUKHV j( / ) 

riu i;SuPPr.'y ,p#a* 

kCfimiG^n ‘^■^^T.jSi'fAMR.PART.vnhCUP PARI.QOH); 

KIEHPI « :'2I;.;£ SUPPLY on PI 

v<Tfc;'^P2 = PfiAMK QOK 

bori" ^TK iPi on dscendlnq OOH 

a 


P-<A.’^F 


COf? 

Cam 

^’UT 

mhT 

SCHfilW 


OOH 


1 

2 

3 




Di)pny»(^> 

fa>5UTTruhscijn.vT(suiPt»ii'if sa«r pi otrr«ii«»f ^#>>2 
AeTin,MJCf:iccnu«x) suppu.pij 

maii'iiUi. « « f 4 )|t 4[ * « « 4! « f % ^ » <|e 41 9ti * « Hi f * ^ i|( ^ ^ f ir # aiic !(! A’ 4= * 

sfTii;Mi»i 3 PROJECT sappLX ex p# , 

Ein4 tnH> nu«ib«r of tapKsft In VtmH »ndr stof® ip CKOUUT)'* 
'■». e WTEMPt * ■ ' .- Y - '■ 


Of tupios In output' relation I* ■ 

\ •, } ,^ , 




to rAI/C SUPPLY SAMg P#,J#sOTY)? 

wt««p} * PWnjiPCT SU5»PLY Bf P# a# OtY 

'in w■l■p:^PJ "do^totdUlna of QTy" wtth sate P# J# 

w = vrFMP2 

.'isoH'/ A witn proper formatting, 


PH 


J « 


OTY 


4 
1 

5 
7 


7 
6 
6 

8 


^ t « 1 1 1 ^ If ^ f « t f « 4c f a !|> « $ f <1. « # ¥ ^ a f >)> a f ^ f f * iK >1: ¥ * a «: a a ’t 


OUFRY^( 4 ) ■'-'=- .» ' 

CUNDTI'IONJ J08,ai»ALL 

»CTin^:GFT SUPPLIER. SWAMC, SUPPLIER, LOCI 

♦♦f 4 i# 4 E!|!*lff» 4 t*’l 4 ' 4 ( 44 c 4 :* 4 t 4 e 4 ! 4 '!|t)!' 4 ! 4 ' 4 t 4 ' 4 c* 4 !* 4 c 4 ' 4 c^** 


^«TEHP1 


JOIN of 

SUPPLIER 

an4 SUPPLY . ‘ 

#4pr«* 

e|i 

«»L, 

Bi- f; 

wtEMpa 

n 

vTOtN of 

.TOR 

an4 wt'gRPi 

Oft 

m 





* \ ' M ^ t 

mW 


'ifgMP.l 

r 

$8 

PgOJKCT 

HtEMP2 

BY SNME \ 

4* m 

l6c 

j# 

>tEMP4 

ss 

PROJCCt 

JOB 

SY J#’’ 

nr^ir 

'•; 


',WjSMP5 

IS 

DIVIDE 

WtgRp3 

Bf-MfEWf# , ' 




M 





= >7TEnP 5 i .. 


S-4A.'4E LuC 

JU*IKS PA*»IS 


'♦f 4 |n|‘ 4 c 4 ' ♦***♦♦♦♦ ', 


PKSt/RE 4 . it EXAMPLE 

• Shr*(n#^ w * 4 * afc w 





JUPPLY Shm P#,J#!0TY)7 

iUffiMiPi ~ HKHaPCT suppi.Y BY PI 

WT'''4<’i 'lo totdlilna ot QtY 

•,>. = ’ ! f':,vip2 

with proper formatting. 


tJi 

with satfe p# 


OTY 


' ' 1 S "t l i'/ 


f>(* 


Jn 


4 
1 

5 
7 


OTY 


7 
6 
6 

8 


;)( * « « « t 4c t « « % « f ^ f « « f « 9^ « $ :<! 1|C !|[ « ](c # it! « 1|( ]|l JfC « 4: Jlc « 4; * f « « ^ iit ^ )|p| f * 


0urHfi'>('-») ’■' •• - 

CUNi:)Ti‘lOM;jaa,J#aALL > ' - ;, 

.^CTin jrGrx Bt1PPLIER.SMAME,SUPPI,IEP.l»0C? 

'Qnerv is n,K,, 


J«TEMP1 

9 

>TaiM of SUPPLIER 

md SUPPLY 

•ilWKlP 

'-m.m 

^ki , ■ ' \ \ ’• 

\i!iTE?«P2 

9 

7UIN Of cfOR 

«n<S HWPl 

WHKWh 

>, ** 


If 

.‘^lsEfE^P4 

% 

9 

PHouECT ^m:m2 

sY smm 

mm * 

LQC 

9 

RROJECt .tOR 

mmmmmm*^ 

BY M' 

nmm 

^ ' 


9 

DIVIDE WTEMP3 

BY WtEMP4 





= WTRMP5 i , 


SN&ME 

m mmm ^ 


LuC 

Wi*«* •* 

PARIS 


I ^ ‘ t - 1. - ^ 11 

: • -V .N :c'' 

FtWE EXAMPLE ouKftt&M' 




ae. 


'’.j-' . M T ! I; SJ'’in-lF,H.S#<>5UPPLy.S» 

Ajri K;,:Fr o^pPi, TER, SUPPLIER, tnc? 

v-jery 1% n,K,, 

4it««t.#4;4cv 4;$ f 


^,TE‘-*P1 
.^Th *k>2 
aTE Ip'^ 

^ rK*^p‘4 

^oTE^PS 


= PkOJECT SUPrLIFR by s# 

= PKnjECT SUPPLY BY S# 

4gn apol npi ml im jm 

= niFFCRPNCli l.F %iTf.MPl AND WTEMP2 

m m tm m m m mm m^mm 

S .TCJl.4 Ot SUPPT-IER and WTEMP3 ©li S# 

W «M ««* <W <«» MR Mk WW 

a PpnjFCT «^TFMP4 gy SSAME iipC 

mmimmi0>in>m mm 

S '^TFMPS 


tsnlny h ^ with pro© 




SUhm , LOC 

MRMeaKMRw ^ mmm 

CI.ftRtC ' LONfiOS 





, • ’!. *4 ' 

FiCUR® 4 ((17 FXA?4^,LE - ,; ^•"’'';>y,"'’;L k".',, 

; , ..^;';:Y:,,.'Y,;;.^<f:.s5t1P«t 



"uj'- ,'y:i ( , ) 

and supply j#s!!'*4* 

ACTiOf.lsr-, S:'fPLrKn,r>}.'A*.iF,SUPPMER.l.OC? 

juery IS i'.K,, 

.*TE"f^ 3 'i2I-«2£ and SUPPLY . on S# 

mm 

i’TL^^p; a «-2iil!;SI WTFMPl by ^sname loc 

ATi:;'-*t’l a 22222SI SUPPLIER BY SKAME L<^ 

«Tt*^P4 S IriTERSECtibJi or MTEMPg ■ and iltE ?<^3 

mrnmmmmmmmimmm «#«i» mmm 

in _ a WTEMP4 ' , . . - - . 

; snamg LOC '• . 

-'1 '**"’***"*p* Awlwaii^ .' 

N AO AMS ATWfEHS 

,5 ' , - V 

?*jrL 

f ’ 

6v» ' 

^''.- ' 
f-\-- 

i^.' 

if: 

t\*'} 'i'" ' I 'i > 


r^ 


'll ^ ■- 
& 


finURK 4,U example . 

IP» tiV'W 4ai t|#i <* 9 «a in i|)r*« fi« 


’■■ f' ' \-A t< 



^ u ^ I* V ^ (* ) 

^0”|-T I’Ti. ' ; 'J jPPMFiJ. i,nc = "ATHl£NS" SUPPLY ,»J#Ss" 4” 

ACTin.'i :{,P’i o>'Pf’LThri,SNRMP,SUPPT,IF:R,tf)C? 

JUery is H.K,, 

* ************* 


,.T£’'P1 

XZ 

vTGiN of SUPPMFR 

antS SUPPLY 

on 

mm 


zz 

pkoject wtrhpi 

BY ^SNAME 

vtWr 

LOC 

#iTc.’’H-l 


PROJECT SUPPLIER 

BY SMAME 

*S» W 

LOC 

«rt^P4 


IMTERSECflOM OF 

W'fEMP2 «n4 

»ltS«F3 


W'reHP4 




Sf^AH?? LOC ' . 

ADAMS AfttENS , ' .V ‘ 

'*■ -^ f «««« )|“|( « <l< « 4c $«» t **;(!««««« «« 4 V * ^ iO * 4^ ^ 4! 


P £<^URE 4.17. RX AMl^V? 'QUSRIBS: 






oup*Kyyfw‘) 

CO*’UlTXnMjPART.CnbfmK"!M,n(.J' HH F&Kf,COLORs’*GRSKN'' 

c T" I n ^ ! i ; R r s ” r o,. i f r . s f , u p ? l x E R , L 0 C ? 

iprv 1 «; , 


. tl: ■'n 

T ,tllT^ of 

SI'PFT.IER 

and SttPPUY 

m^m 

on 

mfm 

s# 

*H' y 

= .Tnl... Of 

PAPT 

and WTkmfi 

on 

p# 





"WP 


i-- TK^’F ^ 

= P HP JECT 

WTFMP2 

BT SNAMF 

LOG 


i- Th;‘'F4 

= .lUlrv Of 

SUPPbiEK 

and snpPLY 

on 

Si 

j* Tb JiF®' 

= JOTw of 

PAR’X 

and wTtMp4 

P9 

on 

Pi 

t.T&MF6 

a PROJECT 

?«fEMPS 

8Y SNAME 

toe 


ftTbHF? 

» IIHION or WtEWp3 
«**«» 

"hm- mmH 

mmm 

, ^ 


#1 a ^fSMl>7 . - • ,_ 

✓ 


SsS^^B 

toe 

>- s 



- 

hiyhms 

JUNKS 

RUAEB 

ATHENS 
PAP IS 
PAP IS 



- 





.fi 


PIGURIC 4,17 mMP^*E OXFEHIBf 

(p» n» «r « wi "^ *» ^ * ***^^^^' 




TUil^v. r Jivjr j. • M 

!i,Fr S’JPPLY.J# 


'<iVyTrTUilSUPPt,IER^5NAHE»‘'fiMTTH« 
.CTlOJti " 




p ,'• 


., r ., M--! 


= ,J«)T., of RunpT Y 
= Pnjf'CT ’'’If'i.Pl 


and supplier on SI 

mmmmtm mm 

BY J# 


= ‘”jf:( 
.u'i-'J i'fu’ I ''' « U ~ 


in ' inE 1 .SHPOLTER.SWRMF,, SUPPLIER, LOC? 


..fc/Vl 


vTJTsv of 

SUPPLIER 

and SUPPLY 

on 

at MM 

SI 

v-TK*'?'! 

zs 

«TuTf4 of 

W 

and WTEMP3 

on 

■PM* 

J* 

v,Th:'’P5 

s 

project 

WTE«P4 

BY SNftME 

mm 

LOC 


#.Tb*'!p6 


PROJECT 

4» «* «•» MW *** 

W 

SY J# 

mm 




a 

DIVIDE 

wTemps 

m if^TEMPS 




a# 


wi 


a WrEMP? 




f 

* ' 
.4' 

iv- 


. 1' . 


S?iA(^E 


SHTIH 

JO^’ER 


LUC 


LONDON 

PARIS 




FIGURE 4.^ EXAMPLE OUERTES 

•t 4r «■ W mm *w ^ ^ Jffl ^ 


, V ‘'"' ‘-‘ .'*- ^'^'' i ' 


■> 



CHAPTER V 


IMPISIIRPTATIOR OP ALGRBEAIC 


-* a\. 


P 


'^'sssiors 


So fo-r '^'fe h-^.vc converted 
sions in the relational o.l^e'brr. 
the following opera.tions - 


the source query into expres- 
Th'so' expressions consist of 


JOIN 

PtOJACT 

DIVIDE 

FIND DIFPEEBNCE 
SORT 

FIND N-TH VAIITE 
COUNT OTETR OF TUPLES 
DO TOTALLING 
AND TRUNCATE 


’/c now give the algorithms for implementing each of 
these opera t-.ions on relations. It has to he pointed out here 
th'^t, at present, no efficient algorithms exist for carrying 
out thos oper'^tions. The efficiency of the existing algorithm, 
depends upon the complexity with which dat.a is stored in the 
system. A more complex storage, involving the maintenance 
of lists, pointers, secondary indexes etc., could result in 
faster .■'Ijorithms for these opercati ns as the access times 
would he consideir.hly reduced. Rut a complex storage structure 
has the disadvantage of making the task of deleting/updating/ 
inserting more complicated. For our algorithms we assume that 
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the relations are stored r.s a list of tuples in the main 
memory of tho computer. All relations > including tho inter- 
mediate relations 5 hence, ha-ve to be searched sequentially. 

The ide? of giving these "Igorithms is ju^’t to show that the 
langua^:? wo have described pioducos correct results. 

5.1 Join; 

This operation is invoked through a call to the proce- 
dure JOIAT (PI, k, R2), where III <and R2 are the numbers of 
the relations to be joined on the common attribute A. The lists 
of tuples in El and R2, pointed at by REIDIR [Blj.RRSTPL and 
RELDIE [R2J.ERSTP1 respectively, are searched seouentially and 
whenever there is a match found for tho va,lues of attribute A 
in a tuple in El and a tuple in 32 then tho two tuples are 
catenated and written into a new tuple which is added to tho 
list of tuples in the new relation being created. Of course, one 
of tho values of the attribute A is omitted in the process. The 
value of IREl, tho pointer to the latest relation .in the 
system, is increased and EELI)IR[ IREl] .ER&TPL is made to point 
to the first tuple in the newly created relation. A listing 
of the procedure JOIN is given in Eig. 5.1- 

5.2 Project/. 

This operation is invoked by a call t'- the procedure 
P'pOJECT(R, Pate) , where R is tho number of the relation to bo 
projected by the attributes whose numbers are in the list 
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point^-Q o"fc by PATH, As describr’d in 4.6, tlae ni'ojcction 
opc^ritin would be of different types depondin" upon the 
condition type (CITDCD). 

For CHDCD = 1,4 end 5y this oper'^tion f/’-ould be? 

P^ OJFCT (attributes point cc" ^t by PATH) HIPHF AC <C01TD> vplue 

For CiTOCP = 2,3 and 7, this oporition would be ; 

ET'PiL^PJ'. 5X. (attributes pointed ot by PATE). 

For CNDCD = 8. this opera i'ion would be ; 

PEOJFCT R BI (attributes point d at by PA.TR) \nBE:B (totsl of 
.attribute TATT8 with sa,ittc SATTS8) <C01TP > TALBI. 

And for CNPCP = 9, this operation '^•''ould bo : 

PROJSCT R 3Y (attributes pointed at by PATE) HT-IE EP (number of 

tuples with sojeae SATTS9 o-nd different DATTS9)<G0RD> VAL9* 

For all the four types of project operations, the list 
of tuples, pointed at by RBLDIR[R] .FRSTPl, is searched sequen- 
tially <and each time tuple is found which satisfies the 
constraint (if any) it is pr'^j'ctcd out into another tuple, 
copying only va.lues of thos^ attributes which are there in the 
list PATR. Each time a new tuple is created, it is apnended 
to tho list of tuples pointed at by RF1DIE[ IRBl] . rESTPL, 
where IREL has hoen incremented in tho beginning of 1iie 
procedure. A procedure TEiTS-^R(TPll,TPL2) is employed to 
transfer the required attribute values from tuple TPIl into 
tuple TP12. A boolean function STSFD(VAll, VAL2) chocks if 
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’ VALl <COND> YAL2' is satisfied. A lisxm,'" of the procedure 
PKOJYCT, procedure THTSFY nid function STcPf is given in 

x^ig. 5.2. 

5.5 ,rhvi_sipn^ 


■'Ohis oporntion is invoiced through n call to the proce-- 
dur" DIVII)‘^(R1 5 P 2 ) , where El is the number of the relation 
which is to bo divided by relation whoso number is R2. In this 
C'^so R2 io a unary relation end relation El has the attribute 
of P2 at the end in its list of attributes. Because thore is 
no duplic 'tion allowed in a relation, a tuple ’X’ will appear 
in thv. output list if and only if ’X' is found in El as many 
tines as there are tuples in S.2. Thus the procedure is similar 
to th-'t for PF0J1CT (for CNDCD=9)s except thet instead of 
VAL9» this procedure uses ][0IIMT1, which is equal to the number 
of tuplos in E2. A listing of the procedure DIVIPR is given in 
Figure 5 . 5 . AS usual lEEL is incremented in the beginning of 
the procedure and EE1DIE[IE5L] .PESTPl points cat tho list of tlio 
tuples in tho newly created relation. 


5.4 


nt hm- Onerations ; 


The remaining operations are rcla.tivoly 


simpler and are 


described belo>^ briefly. 


5.4.1 7ind_ Dif f er enne ; 

This operation is invoked through a call to the procedure 
SIJ3TRACT(P1,E2), where relation E2 is to bo subtracted from 
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rolatiorj El. Both of those rolotions aro unary ancl have tho 
same attribute in then. The lis'c of tuples pointed at by 
RT1LI)IR[ PI] .BRSTPL is travers(.'d through senucntially. For each 
of the tuples in the list, tho list of tuples pointed at by 
RtiLDIR[rt2] .FRSTPl is searched to see if thi^^ tuple in R1 is 
present in E2. In case n, tuple in R1 is present in E2 also 
then the tuple is removed from the list in El. Finally lEFI 
is incremented and EElDIE[IEi'lI] .FRSTPL is made bo point to the 
first tuple of the remaining tuples in El. A listing of the 
procedure SUBTRACT is given in Fig. 5.4. 

5.4.2 Find __ n-th Valu e ; 

This operation is invoked through a co.ll to the procedure 
PINDNTH(R,A,N,PLA(t) , where tho N-th v-^luc (from top if FLAG 
is true* a.nd from bottom if FLAG is false) of attribute A in 
relation R is- to be found. The list of tuples in E is traversed 
through N times and each time the tuplo with the maximum (or 
minimum) value is marked. The tuple which is marked in the end 
contains the value in which wo arc interested. This value is 
copied into the condition table CONTAB[CN] . VALl (integer or 
real as the case may be). Finall3^ th list of tuples is tra-. 
versed once again and all the tuples are unmarked. A listing 
of tho procedure FINDNTH is given in Figure i.4. 
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Tot all ing : 

This operation is invoked through a call to the procodure 
D0T0.4T>(k)j where B is the ■^sl'^tion nunher in which totalling 
is to "b don^ . Th^; list of th oojac attributes is pointed 
°t by ACTTA.B . SATTS8 and the a/atrihuti of '^’'hich the totalling 
has to be done is stored in ACTTAh.TATT. Th'--* list of tuples 
in R is traversed and for tuples which h^ve the sanio va.lues 
of attributes in the same list^ the values of the appropriate 
fisld are summed up. IR.?! is incremented and BRLDIR[IPEI] . RPSTPI 
is made to point at a new tuple which has the attribute values 
of p.t tributes in the same list and the total of the attribute 
ACTTAB.TATT found above. The process is -epeatod for each nex^r 
combination of values of same attributes found in R and new 
tuples are appended to the list in IRBL. A listing of the 
procedure DOTOTAL is given in Rig. 5.5. 

5.4.4 Sor t ; 

t 

This operation is invoked through a call to the procedure 
SORTOUT(R), where R is the relation which is to be sorted as 
per the information stored in ACTTA3.P.S0BT. A simple ripple 
sort methodology is adopted. The procedure sorts the relation 
only on one major field. A listing of the procedure SORTOUT 
is given in Pig. 5.6. 

5 . 4.5 Cou nt the BFumber of T uples ; 

This operation is invoked through a call to the proce- ' 
dure PINDCOOTT(R) ? wh^e S is the relation number of which the 
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nunbcr of tuples is to be found, a vr,-ia'0-3 'lODITT' is initially 
s t to zero. The list of tuples in I?, is traversed and for each 
tuple encountered, the volue of XOUiTT is incrc-n'nted by 1. The 
final vP.luo of KOUNT is the roouired count. A listing of the 
nrocedu-c .?iyj)r! 0 U’'TT is given in Pig. 5.7. 

5 . 4.6 Tri^cjit ey 


This operation is invoked through 0 call to the procedure 
LIi'’ITED( R) , whervj E is the number of the relation in which tlio 
tuples are to be limited to the number stored in ACTTAB.IIfflO. 

A listing of the procedure LIMITED is given in }?ig. 5.7. 

3.4.7 Remo ve Dup l icates ; 

This operation is invoked throu^i a call to the procedure 
T!MVDPLCT(R) , where R is the ro].ation from which all the dupli- 
cate tuples are to be reroovod. The list of tuples is traversed 
sequentially and for each tuple, the remaining of the list is 
searched to find if there exists any more tuples with the same 
values in them. They are remov'ed from the list if they exist. 

A listing of procedure RMYDPLCT is given in Fig. 5 . 7 . 


5.4.8 Union : 

This operation is invoked through a call to the procedure 
UMI 0 H(E 1 ,R 2 ) , where El and R 2 are the two relations to be 
combined. The two relations hav? the same attributes. In this 
the list of tuples in R 2 is appended to the list of tuples in 
R 1 and the procedyre RKVDELCT is- called to remove the duplicate 
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tu'ples froTi the resulting list. IREL is i-ncrsmented and 
P iLDI IPiiL] .PRSlRl is ’Uade to point to the first tuple of 
the now list. A listing cf the procedure UEJOF is given in 
Pig. 5.8. 

5-4.9 I ntersect ; 

This operation is invoked through r call to the prO“ 
ceduro IIWER3ECT(P1, (P2) , whore R1 "^nd R2 ■^re the two relations 
whose intersection has to he found. The two relations have the 
same attributes. The list of tuples in El is tr 3 ,versod and for 
each of th": tuples, the list R2 is searched to find if the 
tuple is present in P2 also. If it is, then the tuple is 
appended to the list of tuples pointed at by PX1IDIR[IRE1] .PPuSTPI 
where IRbL has been incremented in the beginning of the proce- 
dure. A listing of the procedure lETEESECT is given in 
Pig. 5.9. 
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CHA.PTER VI 


CORCLUSIOE 

^ • 1 Su^5iy_ _of_ Done- : 

In this thesis we h-^ve suggested a new aucry language 
for information retrieval from a relatione 1 data base. V/e 
have seen tliat the language is simple and reasonably complete. 
The language is based on thu concept of decision tables and 
does not involve niore difficult concepts of mapping, recursion 
etc. It also does not require one to know 'predicate calculus' 
as does DSl AIPSA. 

’/e have developed a system which processes queries in 
this langup"'©. Ve have first shown how the system converts 
statements of the query to relational algebra ej^pressions and 
then how it applies the algebra operations on the relations 
in the system. A sample data base wa.s cbosen and severe,! 
typos of queries were run on it. The outputs of these have 
been shown. 

The system is general enough in that it can read in any 
new data base and process queries on that date. base. The only 
change required to be made in the system is that the values 
of FOATTS, RORSLS and NORBLSP have to be set to the number of 
attributes in the data base, the original number of relations 
in the data base and -phe total number of relations permitted i 
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tho syston respec tivel^r. Tliis is dons by the COWST declaration! 
in. the beginning of the prograr., Ofcourse, the relations and the 
attributes of tho new d-ta base have to ho defined in files 
RI3LEEF and ATTPSP respGctivcl 3 r. 

The systen has been ex'jensiirely tested and,fron the 
satisfactory results it produces, it c,-n be assjrted with 
reasonable confidence that the new language proeosod in this 
thesis is convertible to algebraic operations which can in turn 
bo applied on the relations in the data base. 

6 . 2 Scope for Improvement s; 

In cose of a complex data base, where two relations can 

% 

be rol.-^ted to each other through more than one relation, tho 
'cable COMBEL could be made three dimensionsJ, storing the list 
of all such common relations in the third dimension. In that 
ense the user must be asked to define his environment before . 
he states his query. During the process of object query genera- 
tion, we look up the common number between the entries in the 
COMREL table and the entries in the array ’EFVI^OITMBDT’ . A 
query would then look as follows - 
QUERY (WOEZSPACB) 

EUYIRONMEUT *. relation names 
dONDITIOH ; condition part 
ACTIOU s action partj 
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Pin,'’.lly, the algorithms for inploaenting tho algohroic 
opv-'r-itions , suggested in Chapter V, can ho mad; ncro efficient 
by storing tho relations as indexed soauential files. In fact; 
much raor~ ’‘'orlc needs to be done to ac’iiove m-'oeimun officioncy 
in the implementation of those basic oper'^tions . (Pef . [12]). 
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